REPORT  DOCUMENTATION  PAGE 


Form  Approved 
OMB  No.  07040188 


Public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and  maintaining  the  data  needed,  and  completing  and 
reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information,  including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for 
Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington,  VA  222024302,  and  to  the  Office  of  Management  and  Budget,  Paperwork  Reduction  Project  (0704-0188),  Washington,  DC  20503. 


1.  AGENCY  USE  ONLY  (Leave blank) 


3.  REPORT  TYPE  AND  DATES  COVERED 


4.  TITLE  AND  SUBTITLE 

AN  EMPIRICAL  EXAMINATION  OF  THE  ROBUSTNESS  OF  ARBITRAGE 
FACTORS 


6.  AUTHOR(S) 

RANDALL  BARRY  HOWARD 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 


University  of  Georgia 


I.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 

97-033D 


9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

THE  DEPARTMENT  OF  THE  AIR  FORCE 
AFTT/CIA,  BLDG  125 
2950  P  STREET 
WPAFB  OH  45433 


10.  SPONSORING/MONITORING 
AGENCY  REPORT  NUMBER 


12a.  DISTRIBUTION  AVAILABILITY  STATEMENT 

Unlimited  Distribution 

In  Accordance  With  AFI  35-205/AFIT  Sup  1 


13.  ABSTRACT  (Maximum 200  words) 


12b.  DISTRIBUTION  CODE 


19971215  156 


14.  SUBJECT  TERMS 


DTIC  QUALITY  INSPECT ED  4 


15.  NUMBER  OF  PAGES 


16.  PRICE  CODE 


17.  SECURITY  CLASSIFICATION 
OF  REPORT 


18.  SECURITY  CLASSIFICATION 
OF  THIS  PAGE 


19.  SECURITY  CLASSIFICATION 
OF  ABSTRACT 


120.  LIMITATION  OF  ABSTRACT 


Standard  Form  298  (Rev.  2-89  EG} 

Prescribed  by  ANSI  Std.  239.16 

Designed  using  Perform  Pro,  WHSIDI0R,  Oct  94 


AN  EMPIRICAL  EXAMINATION  OF  THE  ROBUSTNESS  OF 
ARBITRAGE  FACTORS 


by 

RANDALL  BARRY  HOWARD 

B.S.,  Birmingham  Southern  College,  1986 
M.S.,  Air  Force  Institute  of  Technology,  1992 


A  Dissertation  Submitted  to  the  Graduate  Faculty 
of  The  University  of  Georgia  in  Partial  Fulfillment 

of  the 

Requirements  for  the  Degree 


DOCTOR  OF  PHILOSOPHY 


ATHENS,  GEORGIA 
1997 


AN  EMPIRICAL  EXAMINATION  OF  THE  ROBUSTNESS  OF 
ARBITRAGE  FACTORS 


by 

RANDALL  BARRY  HOWARD 


Approved: 


Approved: 


Date 


This  Dissertation  is  Dedicated  to  My  Family: 
Karen,  Ryan,  and  Allison 


ACKNOWLEDGEMENTS 


This  effort  would  not  have  been  successful  without  the  help  of  many  people.  I 
wish  to  thank  Dr.  Marc  Lipson,  my  advisor,  for  his  constant  support  and  technical 
guidance.  Dr.  William  Megginson  deserves  much  recognition  for  his  support  in  this 
and  other  endeavors.  As  graduate  advisor  and  a  key  member  of  my  committee,  Dr. 
Megginson  was  always  willing  to  help.  I  would  like  to  thank  the  other  members  of 
my  committee,  Dr.  Arthur  Snow,  Dr.  William  Lastrapes,  and  Dr.  Shane  Corwin,  for 
their  knowledgeable  comments  and  direction.  I  could  not  have  asked  for  a  more 
supportive  committee.  Special  thanks  also  to  Elaine  Dunbar  -  without  her  knowledge 
of  "the  official  way"  to  write  a  dissertation,  I  never  would  have  finished! 

Of  course,  none  of  this  would  have  been  possible  without  the  support  of  the  United 
State  Air  Force.  They  have  provided  me  with  educational  opportunities  unparalleled 
by  any  other  employer.  I  am  especially  grateful  to  the  office  of  the  Deputy  Assistant 
Secretary  of  the  Air  Force  for  Cost  and  Economics  and  the  Air  Force  Military 
Personnel  Center  for  selecting  me  for  this  assignment.  Special  thanks  also  to  the 
Department  of  Banking  and  Finance  for  allowing  me  to  attend  their  program  in  my 
somewhat  unique  situation  as  a  military  officer. 

I  wish  to  thank  my  parents  for  their  life-long  encouragement  to  succeed  in  all  my 
endeavors.  Finally,  and  most  importantly,  I  wish  to  thank  my  family  who  has 
supported  me  throughout  this  entire  program  -  anything  is  possible  when  you  have  a 
family  like  mine  behind  you.  During  the  past  years  I  have  been  a  student  more  often 
than  a  husband  and  father.  I  look  forward  to  spending  much  more  time  with  my 
family  in  the  coming  months  and  years. 

Randall  Barry  Howard 


IV 


TABLE  OF  CONTENTS 


Page 

ACKNOWLEDGEMENTS  .  iv 

CHAPTER 

1  INTRODUCTION .  1 

2  ARBITRAGE  PRICING  THEORY . , .  6 

Initial  Development .  6 

Factor  Specification .  10 

Theoretical  Extensions  of  the  APT .  14 

Testability  of  the  APT .  18 

Summary  .  19 

3  REVIEW  OF  THE  LITERATURE .  20 

Testing  the  APT .  20 

Factor  Analysis  .  22 

Asymptotic  Principal  Components .  31 

Pre-Specified  Factors .  38 

Comparing  the  Various  Techniques  .  47 

Various  Other  Techniques  and  Contributions  .  51 

Summary  .  52 

4  TESTABLE  HYPOTHESES  AND  METHODOLOGY .  54 

Testable  Hypotheses  .  55 

Methodology .  57 

5  DATA  SELECTION  AND  SOURCES .  64 

Security  Returns .  64 


v 


vi 

Macroeconomic  Variables .  66 

Summary  .  70 

6  RESULTS .  71 

Hypothesis  #1  -  Equivalence  of  Factors .  71 

Hypothesis  #2  -  Risk  Premiums  .  81 

7  SUMMARY  AND  CONCLUSIONS . 87 

BIBLIOGRAPHY .  93 

APPENDIX  A  -  TABLES .  105 

APPENDIX  B  -  FIGURES  .  141 

APPENDIX  C  -  INDUSTRY  GROUPINGS .  145 


CHAPTER  1 


INTRODUCTION 

After  thirty  years  of  vigorous  research,  there  is  still  little  agreement  in  the 
field  of  asset  pricing  theory.  Shanken  and  Smith  (1996)  sum  up  the  vast  amount  of 
empirical  research  on  asset  pricing  models  by  saying,  "Although  we  have  learned 
much  about  the  cross-sectional  and  time-series  properties  of  returns  and  have 
developed  sophisticated  statistical  methods  to  increase  the  power  of  the  tests, 
numerous  unanswered  questions  remain. "  Two  of  the  most  fundamental,  yet 
unanswered,  questions  are:  How  many  factors  are  there?  and  What  are  those  factors? 

The  two  primary  equilibrium,  expected  return  models  are  the  Capital  Asset 
Pricing  Model  (CAPM),  developed  almost  simultaneously  by  Sharpe  (1964),  Lintner 
(1965),  and  Mossin  (1966),  and  the  Arbitrage  Pricing  Theory  (APT),  introduced  by 
Ross  (1976,  1977).  The  CAPM  is  a  one  factor  model  that  states  that  the  equilibrium 
rate  of  return  on  any  asset  is  a  linear  function  of  the  asset's  covariance  with  the 
market  portfolio.  The  APT,  on  the  other  hand,  is  a  multifactor  model. 

Although,  while  initially  the  CAPM  received  widespread  support,  it  has 
subsequently  been  rejected  in  numerous  papers.  For  example,  Roll  (1977)  provides 
his  seminal  critique  concerning  the  testability  of  the  CAPM.  The  finance  literature 
has  documented  numerous  anomalies  in  return  series  that  cannot  be  explained  by  the 
CAPM.1  Perhaps  the  strongest  evidence  against  the  CAPM  is  Fama  and  French's 


!The  three  main  anomalies  are:  the  day  of  the  week  effect,  the  small  firm  effect, 
and  the  January  effect.  See  for  example  Banz  (1981),  Basu  (1983),  Keim  (1983),  and 
Bhandari  (1988).  Megginson  (1997)  provides  an  excellent  review  of  these  anomalies 
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(1992)  conclusion  that  beta  has  little  significant  power  in  explaining  the  cross- 
sectional  variation  in  asset  returns.  Despite  this  evidence,  the  CAPM  remains  the 
preferred  model  of  choice  for  many  financial  practitioners  and  by  academicians  for 
classroom  use  in  undergraduate  and  MBA  programs. 

Jagannathan  and  Wang  (1996)  suggest,  as  do  many  others,  that  the  CAPM 
survives  for  several  reasons.  First,  most  of  the  other  asset  pricing  models  do  not  fare 
much  better.  Second,  the  theory  behind  the  CAPM  has  an  intuitive  appeal  that  the 
other  models  lack.  More  fundamentally,  there  is  some  debate  as  to  whether  or  not 
the  evidence  against  the  CAPM  is  economically  significant.2 

Recently,  numerous  papers  have  argued  that  multifactor  models  may  provide  a 
better  description  of  average  returns.  Among  the  multifactor  models,  one  can  argue 
that  Ross's  Arbitrage  Pricing  Theory  (APT)  stands  out  as  the  heir  apparent  to  the 
CAPM.3  The  APT  is  attractive  for  several  reasons.  It  is  based  on  relatively  weak 
assumptions  and  allows  for  multiple  sources  of  systematic  risk.  The  market  portfolio 
plays  no  special  role  and  the  APT  is  valid  on  subsets  of  assets. 

Perhaps  the  biggest  obstacle  associated  with  the  APT  is  it's  silence  on  the 
number  of  priced  factors  required  and  what  they  are.  If  the  factors  were  actually 
known,  then  the  APT  would  have  the  same  intuitive  appeal  of  the  CAPM.  Investors 


in  his  Chapter  3,  page  120. 

2Kothari,  Shanken,  and  Sloan  [KSS]  (1995)  suggest  that  many  of  the  adverse 
findings  depend  on  how  one  interprets  the  usual  statistical  tests.  Amihund, 
Christensen,  and  Mendelson  (1992)  argue  that  return  data  is  too  noisy  to  reject  the 
CAPM.  Black  (1993)  argues  that  some  of  the  rejections  are  a  sample  period  effect. 
KSS  (1995)  suggests  that  Fama  and  French's  (1992)  conclusion  on  beta  was  due  to 
bias  in  Compustat  data. 

3Although  the  APT  is  just  one  example  of  a  multifactor  model  the  terms  are  often 
used  interchangeably  in  the  literature. 
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are  only  willing  to  pay  for  risk  that  cannot  be  diversified  away  and  there  are  several 
different  priced  factors. 

The  empirical  and  theoretical  literature  on  the  APT  is  extensive.  The  general 
consensus  is  that  the  APT  outperforms  the  CAPM.  Specifically,  it  explains  most,  but 
not  all,  of  the  anomalies  associated  with  the  CAPM,  it  explains  a  larger  percentage  of 
the  cross-sectional  variation  in  returns,  and  it  has  lower  pricing  errors  associated  with 
it.  However,  there  is  little  agreement  as  to  the  number  and  nature  of  significant 
factors.  For  example,  numerous  researchers  have  shown  that  the  number  of 
significant  factors  is  dependent  on  sample  size,  sorting  schemes,  time  period,  etc. 

One  possible  explanation  for  these  observations  is  that  the  APT  performs  better 
because  it  is  taking  advantage  of  sample  specific  information.  Although  several 
papers  have  suggested  this  possibility  (either  directly  or  indirectly),  there  has  been 
little  empirical  work  in  the  area. 

This  dissertation  seeks  to  answer  a  critical  question  regarding  the  APT.  Are 
the  returns  actually  generated  by  a  k  factor  model,  as  the  APT  hypothesizes,  or  are 
we  simply  improving  the  explanatory  power  for  a  unique  sample  by  adding  extra 
independent  variables?  It  could  be  that  the  APT  fares  better  than  the  CAPM  because 
the  additional  factors  of  the  APT  exploit  information  that  is  unique  to  the  sample  at 
hand,  and  are  not  really  representative  of  a  true,  economy  wide  linear  factor 
structure. 

Previous  tests  of  the  APT  concentrate  on  the  significance  of  factors  "in- 
sample".  In  contrast,  I  focus  on  whether  or  not  the  factors  are  pervasive.  To 
illustrate  the  difference  consider  a  portfolio  consisting  only  of  securities  related  to  the 
mining  of  precious  metals.  Clearly  one  would  not  be  surprised  to  find  some  measure 
of  inflation  is  a  "priced"  or  significant  factor  for  this  sample.  This  does  not  mean 
that  all  assets  will  be  affected  by  inflation.  I  consider  a  factor  to  be  pervasive  if  it 
affects  subsets  of  the  entire  population. 
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If  security  returns  are  generated  by  a  A:  factor  model,  the  factors  extracted 
from  different  groups  of  securities  (for  a  given  time  period),  should  be  asymptotically 
equal.  If  the  factors  are  significantly  different,  then  it  must  be  that  either  the 
extraction  technique  is  not  valid  or  different  groups  of  securities  are  affected  by 
different  factors.  The  latter  of  which  violates  a  key  assumption  of  the  APT. 

Although  the  APT  is  valid  on  subsets  (i.e.,  it  does  not  require  the  entire  universe  of 
securities  to  be  used),  one  would  expect  the  factors  to  be  the  same  if  the  samples  are 
sufficiently  large  and  drawn  from  the  same  population.  Given  that  the  factors  are 
equal,  then  if  the  APT  is  correct,  the  market  price  of  risk  for  each  of  the  factors 
should  also  be  the  same. 

Work  in  this  area  to  date  includes  Brown  and  Weinstein  (1983),  Cho  (1984), 
and  Conway  and  Reinganum  (1988).  Brown  and  Weinstein  attempt  to  compare 
factors  extracted  from  distinct  portfolios.  Cho  extracts  only  those  factors  that  are 
common  among  portfolios.  Conway  and  Reinganum  use  the  technique  of  cross 
validation  to  determine  if  the  factors  contribute  in  explaining  out-of-sample  returns. 
Unfortunately,  as  will  be  discussed  in  the  following  chapters,  these  papers  rely  on 
small  portfolios  and  questionable  methodologies.  Using  extensive  simulation  data,  I 
hope  to  demonstrate  a  fundamentally  sound  technique  for  directly  comparing  factors 
extracted  from  large  portfolios  of  security  returns. 

Using  factors  extracted  via  Connor  and  Koraczyk's  (1986,  1988)  asymptotic 
principal  components  technique,  I  hope  to  show  that  if  returns  are  generated  by  a  A: 
factor  process  the  first  k  factors  will  be  the  same  when  extracted  from  different 
portfolios.  I  then  will  analyze  large  portfolios  of  security  returns  to  see  if  they  are 
generated  by  k  pervasive  factors.  As  an  additional  test  of  the  APT,  I  will  look  at  the 
estimated  "price"  of  the  factors  when  using  different  portfolios.  As  will  be  discussed 
in  the  following  sections,  macroeconomic  proxies  will  be  used  for  the  factors  when 
testing  if  the  price  is  the  same  across  portfolios.  This  test  might  also  be  useful  in 


5 


determining  which  macroeconomic  variables  are  the  best  proxies  for  the  true  but 
unknown  factors. 

This  dissertation  is  organized  as  follows:  Chapter  2  provides  a  brief 
theoretical  review  of  the  APT  and  provides  some  basic  notation.  A  review  of  the 
pertinent  literature  is  found  in  Chapter  3.  Next,  in  Chapter  4, 1  discuss  the  testable 
hypotheses  and  methodologies.  The  various  sample  selection  criteria  and  data  sources 
are  discussed  in  Chapter  5.  Chapter  6  presents  the  empirical  results  of  the  study. 
Finally,  Chapter  7  provides  a  summary  of  the  results  and  conclusions,  and  offers 
some  recommendations  for  future  research. 


CHAPTER  2 


ARBITRAGE  PRICING  THEORY 

The  introduction  of  the  Arbitrage  Pricing  Theory  (APT)  by  Ross  (1976,  1977) 
is  considered  by  many  academicians  as  one  of  the  most  significant  developments  in 
Finance.  Its  introduction  has  generated  a  large  body  of  both  theoretical  and  empirical 
literature.  This  chapter  starts  with  a  review  of  Ross's  original  development  of  the 
APT.  Next,  I  discuss  the  major  approaches  used  to  implement  tests  of  the  APT. 
Then,  I  discuss  the  major  theoretical  extensions  of  the  APT.  Finally,  I  examine  the 
testability  of  the  APT.  The  following  chapter  reviews  the  pertinent  empirical 
literature,  focusing  on  those  papers  that  examine  the  number  or  stability  of  the  priced 
factors. 

2.1  Initial  Development 

This  section  provides  a  brief  review  of  Ross's  (1976,  1977)  original 
development  of  the  APT.  The  APT  assumes  the  normal  perfectly  competitive  and 
ffictionless  market  assumptions  (e.g.,  no  taxes  or  transaction  costs,  and  infinitely 
divisible  assets).  It  assumes  that  individual  investors  have  homogeneous  expectations 
about  asset  returns  and  that  the  random  returns  for  all  assets  are  generated  by  a  linear 
k-factor  model  of  the  following  form:4 


4As  in  Ross's  original  development  -  no  time  subscript  is  used.  In  fact,  the  APT 
is  valid  even  in  an  intertemporal  setting. 
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fi  =  Ei +  bu  A  +  -  +  K  h  +  -  +  g« 
Ei  +  bif+  ij. 


(2.1) 


where 

f  =  the  rate  of  return  on  asset  i 

^  =  the  expected  return  on  asset  i 

bjk  =  the  sensitivity  of  asset  i  to  the  k!11  common  factor 

f  _  the  mean  zero,  k*  factor  common  to  the  return  on  all  assets. 

J  k 

g  =  the  mean  zero,  idiosyncratic  return  on  asset  i 
Or,  in  a  more  convenient  matrix  notation: 

r  =  E  +  Bf  +  e  (2‘2) 

where  r,  E  and  e  are  n  x  1 ,  B  is  n  x  k,  and  f  is  k  x  1 . 

The  main  additional  assumptions  involve  those  necessary  for  a  diversification 
argument,  which  requires  that  one  is  able  to  completely  diversify  away  the 
idiosyncratic  risk.  This  requires  that  n  be  much  larger  than  k  and  that  the  noise 
vector,  g ,  be  sufficiently  independent  so  that  idiosyncratic  risk  can  be  eliminated  in 
large  portfolios.  A  sufficient,  but  not  necessary,  condition  for  this  requirement  is  that 
the  g.'s  be  mutually  stochastically  uncorrelated,  and  have  finite  variances.  This  is  the 

assumption  originally  used  by  Ross,  although  as  we  will  see  later,  he  notes  that  it  can 
be  weakened.  This  assumption,  that  the  covariance  matrix  of  the  idiosyncratic  return 
component  is  diagonal,  gives  us  an  exact  factor  structure. 

The  APT,  as  the  name  implies,  is  based  on  a  simple  no  arbitrage  argument. 
Any  portfolio  which  uses  no  wealth  and  involves  no  risk  must,  on  average,  yield  zero 
return.  The  development  proceeds  along  the  following  lines.  Let  r\  represent  an 
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n  x  1  vector  representing  the  wealth  invested  in  each  asset  and  let  ^  be  an  n  x  1 
constant  vector  of  ones.  Design  a  portfolio  that  requires  zero  investment  (i.e.,  the 
total  long  position  equals  the  total  short  position),  giving  us: 

r\'\n  =  0  (2-3) 

We  will  also  assume  that  the  portfolio  is  well  diversified 

(i.e.,  r|.  ~  —  V.  =  1,  •••,»).  With  our  earlier  assumptions  then,  the  law  of  large 
1  n  1 

numbers  will  allow  us  to  diversify  away  the  idiosyncratic  risk.5  The  return  on  this 
portfolio,  Rp,  is  given  by  multiplying  equation  (2.2)  by  r )/: 

Rp  =  T\'r  =  r\'E  +  (t)  /B)f  +  (2.4) 

=  r\'E  +  (n  'B)f 

Further  assume  that  we  engineer  the  weights  of  the  portfolio  so  that  it  has  zero 
systematic  risk,  giving  us 

=  0  V  i  =  1,  •••,  k  (2.5) 

or  ri'B  =  0 

and  then  from  equation  (2.4)  we  have 

RP  =  n'E  (2.6) 

Now,  the  random  return  is  a  certainty  and  to  rule  out  arbitrage,  must  be  equal  to 
zero,  giving  us 

r|/E  =  0  (2.7) 

Therefore,  if  r|  is  orthogonal  to  1„  [equation  (2.3)]  and  r\  is  orthogonal  to  B  [equation 
(2.5)],  r|  must  also  be  orthogonal  to  E  [equation  (2.7)].  From  linear  algebra,  this 


5In  the  strict  sense  we  have  hm  rj/e  -  0  ,  for  now  we  will  assume  r\'e  =  0, 
and  later  we  will  adjust  our  results  accordingly. 
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will  only  occur  if  E  can  be  written  as  a  linear  combination  of  and  the  b  ;'s.  This 
gives  us  the  ex-ante  expected  return  form  of  the  APT 

Ei  A0  +  Xxbu  +  •••  +  Xk  bik  (2.8) 

*  A,0  +  A  bt 

Or  in  matrix  notation 

E  In  +  XB  (2.9) 

Equations  (2.8)  and  (2.9)  are  not  strict  equalities,  since  r^g  =  o  only  in  the 

limit  as  n  approaches  oo.  Eq  can  be  interpreted  as  the  return  on  an  asset  with  zero 
systematic  risk  (i.e.,  a  risk-free  rate  or  the  return  on  a  zero-beta  asset).  As  for  the 
Vs,  consider  an  asset  with  unit  risk  to  factor  j  (i.e.,  by  =  1)  and  zero  risk  to  all  other 
factors  (i.e.,  bik  =  0  V  k  *  j).  From  equation  (2.9),  the  expected  return  on  this  asset 
will  be 

E^Ao  +  Xj  (2.10) 

giving  us 

A-j  «  Ej  -  Aq  (2.11) 

so  the  Vs  are,  in  essence,  risk  premium's.  Admati  and  Pfleiderer  (1985)  demonstrate 
that  the  Vs  can  be  interpreted  as  excess  returns  on  portfolios  perfectly  correlated  with 
the  factors  if  a  riskless  asset  and  such  portfolio's  exist. 

In  summing  up  this  development  of  the  APT,  it  is  worth  highlighting  some  of 
the  strengths  of  the  APT.  First,  the  APT  makes  no  assumptions  about  the  underlying 
distribution  of  security  returns.  Nor  does  it  make  any  assumptions  about  individual 
investors  utility  functions  (other  than  nonsatiation).  In  contrast,  the  CAPM  requires 
either  normally  distributed  returns  or  a  quadratic  utility  function  for  investors;  neither 
of  which  are  very  appealing  assumptions.  Finally,  the  market  portfolio  plays  no 
special  role  in  the  APT  and  the  APT  is  valid  on  subsets.  These  strengths  suggest  that 
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the  APT  is  a  good  alternative  to  the  CAPM.6  Unfortunately,  the  APT  is  not  without 
serious  flaws  of  it's  own.  Certainly  the  most  fundamental  problem  is  the  APT's 
silence  on  the  number  and  nature  of  the  factors.  This  weakness  is  important  for  two 
reasons.  First,  as  Megginson  (1997)  points  out,  it  makes  it  almost  impossible  to 
operationalize  the  model  in  a  corporate  finance  setting.  Secondly,  it  causes  a 
multitude  of  econometric  problems  which  will  be  discussed  in  detail  in  the  following 
chapter.  Basically,  before  any  tests  can  be  performed  in  an  APT  setting,  we  have  to 
identify  the  factors,  their  associated  loadings,  and  their  prices. 

2.2  Factor  Specification 

The  primary  weakness  of  the  APT  is  it's  silence  on  the  number  of  factors  and 
what  the  factors  are.7  In  addition  to  making  the  theory  somewhat  ambiguous,  this 
causes  numerous  challenges  when  designing  tests  of  the  theory.  The  first  challenge  is 
that  one  must  somehow  identify  the  factors.  Connor  (1995)  classifies  multifactor 
models  into  one  of  three  groups  based  on  how  they  tackle  this  identification  problem. 
He  calls  these  groups:  1)  macroeconomic  factor  models,  2)  fundamental  factor 
models,  and  3)  statistical  factor  models.  These  groupings  are  not  entirely  distinct  and 
some  researchers  have  looked  at  models  that  use  two  or  more  of  the  different  types  of 
factors.  For  example,  Fama  and  French  (1993)  combines  elements  of  fundamental 
and  macroeconomic  factor  models  and  Burmeister  and  McElroy  (1988)  use  statistical 
and  macroeconomic  factors.  The  following  sections  will  highlight  the  relative 


6Another  interesting  strength  of  the  APT  is  first  discussed  by  Jarrow  (1988).  He 
argues  that  the  APT  is  based  only  on  ordinal  utility  theory,  rather  than  cardinal  utility 
theory  as  in  the  CAPM.  This  allows  for  systematic  violations  of  the  strong 
independence  axiom  as  some  argue  occurs  in  the  marketplace.  He  suggests  that  the 
most  restrictive  assumption  of  the  APT  is  the  linear  factor  hypothesis. 

7In  fact,  Fama  (1997)  argues  that  determining  the  number  of  factors  in  the  APT 
or  the  ICAPM  is  hopeless! 
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strengths  and  weaknesses  of  each  of  these  types  of  models.  Although  this  research 
will  rely  on  statistical  and  macroeconomic  factor  models,  a  brief  review  will  be  given 
for  each  of  the  three  types. 

2.2.1  Macroeconomic  Factors 

One  way  to  tackle  the  identification  problem  is  to  simply  pre-specify  the 
factors.  Economic  theory  offers  insight  as  to  what  factors  might  systematically  affect 
the  return  on  a  security  and  the  plethora  of  recorded  variables  offer  observable 
proxies  for  these  factors.  The  primary  advantage  of  this  technique  is  that  it  gives  an 
intuitive  appeal  to  the  APT,  as  the  factors  now  have  some  economic  meaning.  It  also 
avoids  some  econometric  problems  associated  with  using  estimated  factors.  Since  we 
actually  observe  the  proxies,  we  can  conduct  individual  Mests  for  whether  or  not  the 
factors  are  priced.  As  we  will  see  in  the  next  chapter,  many  of  the  other  techniques 
do  not  admit  individual  tests  for  significance. 

Unfortunately,  the  use  of  pre-specified  factors  causes  some  unique  econometric 
problems.  First,  each  of  the  proxies  has  to  measure  the  unanticipated  movement  (or 
innovation)  of  the  relevant  factors.  If  one  simply  used  the  underlying  variable, 
without  accounting  for  expected  movement,  it  would  result  in  an  errors-in-variables 
problem,  thus  rendering  the  usual  statistical  tests  useless.  Also,  many 
macroeconomic  variables  are  correlated  and  this  can  cause  other  econometric 
challenges.8  Finally,  most  macroeconomic  data  is  only  available  monthly  thus 
eliminating  the  use  of  weekly  or  daily  returns. 

This  technique  is  also  dangerous  in  that  one  must  arbitrarily  choose  which 
variables  to  use.  Fama  (1991)  points  out  that  these  approaches  offer  much  flexibility 
and  promise,  but  there  is  the  danger  that  the  relations  are  simply  due  to  special 
features  of  a  particular  sample  ("factor  dredging").  For  instance,  it  might  be  that 

Significant  correlation  among  the  factors  can  also  diminish  the  intuitive  appeal  of 
this  technique. 
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during  an  oil  embargo  a  variable  associated  with  oil  prices  is  significant,  yet  in  other 
periods  it  might  not  be. 

Not  only  do  we  have  to  choose  the  macroeconomic  proxies  we  believe  will 
impact  prices,  we  also  have  to  align  the  variables  with  the  stock  return  data.  In  other 
words,  how  do  the  proxies  relate  to  the  factors  in  a  time  series  sense.  Consider  a 
macroeconomic  variable  whose  value  for  the  previous  month  (say  April),  is 
announced  on  the  15th  of  this  month  (May).  Is  the  asset's  return  driven  by  the  new 
information  or  the  actual  underlying  effects  of  the  macroeconomic  factor?  If  it  is 
more  of  an  underlying  relationship,  then  May's  return  will  be  driven  by  May's 
observation  (even  though  it  won't  be  released  until  June).  Then  one  must  decide 
what  to  do  when  the  variable  is  updated  or  the  technique  for  calculating  it  is  revised. 

Finally,  these  types  of  models  do  not  seem  to  perform  as  well  as  other 
multifactor  models  do.  Connor  (1995)  using  an  explanatory  power  approach,  finds 
that  the  fundamental  and  statistical  factor  models  consistently  outperform  the 
macroeconomic  factor  models. 

2.2.2  Fundamental  Factors 

Fundamental  factor  models  use  observed  company  parameters  (e.g.,  book  to 
market  value)  as  factor  betas.  These  methods  then  set  up  mimicking  portfolios  to 
identify  the  extra  return  associated  with  a  unit  exposure  to  each  factor. 

Connor  (1995)  find  that  statistical  and  fundamental  factor  models  significantly 
outperform  macroeconomic  factor  models,  and  that  fundamental  factor  models  slightly 
outperform  the  statistical  factor  models.  Connor  suggests  that  the  fundamental  factor 
models  outperform  the  statistical  factor  models  because  they  include  additional 
information.  The  statistical  factor  models  maximize  explanatory  power  based  solely 
on  returns  data.  Whereas  the  fundamental  factor  models  incorporate  firm  specific 
variables  such  as  book-to-market  value,  firm  size,  and  dividend  yield.  Additional 
information  is  also  often  provided  in  the  form  of  numerous  industry  dummies. 
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Since  fundamental  factor  models  are  not  used  in  this  research,  they  will  not  be 
reviewed  in  the  following  chapter.9  However,  given  the  recent  visibility  of  these 
types  of  models,  it  is  worthwhile  to  briefly  discuss  them.  Fama  and  French  (1992, 
1993)  develop  a  three  factor  model  consisting  of  size,  book-to-market  value,  and  a 
broad  market  index.  They  suggest  that  their  model  works  because  the  fundamental, 
firm  characteristics  proxy  for  two  unknown,  underlying  risk  factors.  As  such, 
fundamental  factors  may  play  an  important  role  in  searching  for  answers  about  the 
APT.  If  we  can  find  firm  characteristics  that  seem  to  proxy  for  factors,  these 
characteristics  might  offer  clues  as  to  the  nature  of  the  underlying  factors. 

Specifically,  they  might  lead  us  to  those  macroeconomic  variables  that  proxy  for  the 
factors. 

Unfortunately,  the  evidence  on  Fama  and  French’s  model  is  mixed.  Kim 
(1995)  suggests  that  Fama  and  French's  results  are  due  to  an  errors  in  variable 
problem.  In  a  recent  paper,  Daniel  and  Titman  (1997)  argue  that  there  is  no  risk 
factor  associated  with  high  or  low  book-to-market  firms.  They  also  argue  that  there 
is  no  return  premium  associated  with  any  of  Fama  and  French's  (1993)  factors.  They 
assert  that  the  strong  covariances  among  firms  with  high  book-to-market  value  are 
due  to  similar  firm  characteristics  (e.g.,  same  industries)  and  not  common  exposure  to 
a  priced  risk  factor.  Knez  and  Ready  (1997)  argue  that  Fama  and  French's  result  on 
size  is  due  to  a  few  extreme  outliers. 

2.2.3  Statistical  Factors 

Statistical  factor  models  use  various  statistical  technique  to  extract  the  factors 
from  cross-sectional  and  time  series  samples  of  security  returns.  The  primary 
strength  of  these  techniques  is  that  one  does  not  have  to  make  arbitrary  choices  about 
what  variables  to  include.  The  data  itself  determines  the  factors.  There  are 

9Examples  of  empirical  papers  that  use  a  fundamental  factor  model  include 
Rosenberg  (1974),  and  Beckers,  Grinold,  Rudd,  and  Stefek  (1992). 
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numerous  techniques  for  "extracting"  the  factors  from  the  data,  and  each  of  these 
techniques  has  its  associated  strengths  and  weaknesses.  This  review  will  focus  on 
two  major  techniques.  The  first  approach,  used  in  many  early  papers,  involves  some 
form  of  maximum-likelihood  factor  analysis  to  extract  the  factors  and  their  loadings 
from  the  sample  covariance  matrix  of  asset  returns.  The  second  approach  involves  an 
asymptotic  principal  components  technique,  due  to  Connor  and  Korajczyk  (1986, 

1988)  that  is  similar  to  factor  analysis  but  allows  one  to  use  large  numbers  of 
securities.  These  techniques  will  be  reviewed  in  detail  in  the  following  chapter. 

2.3  Theoretical  Extensions  of  the  APT 
As  with  the  CAPM,  it  was  not  long  before  academicians  began  questioning  the 
validity  of  the  APT.  Most  of  these  questions  revolved  around  the  approximate  nature 
of  the  pricing  relation.  Shanken  (1982)  points  out  that  equation  (2.9)  is  an 
approximate  pricing  relationship,  not  an  equilibrium  model  that  must  be  satisfied  by 
all  assets.10  Consequently,  it  should  price  most  assets  with  negligible  error,  however, 
it  allows  for  arbitrarily  large  deviations  for  a  small  number  of  assets  in  any  finite 
economy.  Not  surprisingly,  many  researchers  began  to  look  at  the  magnitude  of  the 
pricing  error  and  offered  alternative  derivations  based  on  differing  assumptions.  A 
comprehensive  bibliography  of  these  early  theoretical  contributions  can  be  found  in 
Dybvig  and  Ross  (1985).  Some  of  the  key  results  are  discussed  in  the  following 
paragraphs. 


10Actually,  the  static  APT  and  CAPM  are  not  true  "pricing"  models.  By  pricing 
models  in  this  context  we  are  talking  about  the  ability  to  determine  expected  returns. 
An  example  of  a  true  "pricing"  model,  one  that  actually  can  price  securities,  is  the 
intertemporal  version  of  the  APT  as  discussed  in  Roll  and  Ross  (1980),  or  the  more 
general  model  of  Cox,  Ingersoll,  and  Ross  (1985). 
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2.3.1  Approximate  Factor  Structure 

Chamberland  and  Rothschild  propose  an  approximate  factor  structure  that  is  a 
much  weaker  assumption  than  Ross's  strict  factor  structure.  As  mentioned  earlier,  a 
strict  factor  structure  implies  that  the  idiosyncratic  components  of  asset  returns  have 
zero  correlation  across  assets.  Connor  and  Korajczyk  (1993)  offer  an  intuitive 
example  of  what  a  strict  factor  structure  implies.  One  might  easily  agree  that 
awarding  a  large  defense  contract  to  one  aerospace  firm  might  affect  several 
aerospace  firms.  A  strict  factor  structure  would  require  this  industry-specific  risk  to 
be  a  universal  factor.  Chamberland  and  Rothschild  point  out  that  it  is  unlikely  any 
large  group  of  securities  will  have  a  usefully  small  number  of  factors  given  a  strict 
factor  structure.  In  an  approximate  factor  structure,  the  idiosyncratic  risks  can  be 
correlated  and  hence  the  idiosyncratic  covariance  matrix  need  not  be  diagonal.  So, 
industry-specific  uncertainty,  for  example,  does  not  have  to  be  a  pervasive  factor. 
Instead,  it  will  just  show  up  as  an  off-diagonal  term  in  the  covariance  matrix  of  the 
idiosyncratic  risks. 

Chamberland  and  Rothschild  (1983)  show  that  if  the  covariance  matrix  of  the 
asset  returns  has  k  unbounded  eigenvalues,  then  an  approximate  factor  structure  exists 
and  it  is  unique.  They  also  show  that  as  the  number  of  securities  grow,  the  k 
eigenvectors  associated  with  the  k  unbounded  eigenvalues  will  asymptotically 
converge  and  play  the  role  of  the  factor  loadings.  Connor  (1984)  and  Ingersoll 
(1984)  also  weaken  Ross’s  original  assumptions  and  derive  results  similar  to  those  of 
Chamberland  and  Rothschild.  Grinblatt  and  Titman  (1985)  verify  the  Chamberlain 
and  Rothschild  results  and  show  that  any  economy  that  satisfies  an  approximate  factor 
structure  can  be  transformed,  without  altering  the  portfolios  investors  hold,  to  an 
equivalent  exact  factor  structure. 
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2.3.2  Pricing  Bounds 

Ross  (1976),  in  his  original  development  of  the  APT,  showed  that  the  sum  of 
squared  approximation  errors  in  equation  (2.9)  is  finite  as  the  number  of  securities 
approach  infinity.  Subsequently,  Huberman  (1982)  looks  at  the  no  arbitrage  condition 
and  provides  conditions  under  which  an  economy  will  allow  idiosyncratic  risk  to  be 
completely  diversified  away.  Chamberlain  (1983)  provides  a  pricing  bound  for  the 
approximate  factor  structure  model.  Connor  (1984)  shows  that  the  sum,  across 
assets,  of  squared  deviations  goes  to  zero  as  assets  are  added.  Shanken  (1992)  shows 
that  a  finite  pricing  bound  holds  when  the  factors  are  replaced  by  proxies  that  are 
sufficiently  close  to  the  true  factors.  Each  of  these  papers  makes  additional 
assumptions,  but  none  of  the  assumptions  are  overly  restrictive.  For  example,  many 
assume  that  there  are  portfolios  that  mimic  the  factors,  that  some  agent  holds  a  well 
diversified  portfolio,  that  there  are  many  assets,  and  that  all  assets  are  in  positive 
supply. 

Whereas  the  preceding  papers  all  developed  bounds  for  pricing  errors  among 
all  assets,  Craig  and  Malkiel  (1982)  give  an  intuitive  explanation  as  to  why  the 
pricing  error  for  individual  assets  should  be  small.  Connor  (1984),  Dybvig  (1983), 
and  Grinblatt  and  Titman  (1983)  actually  derive  specific  expressions  for  the  pricing 
error  for  individual  assets.  Connor,  and  Grinblatt  and  Titman  assume  investors  are 
risk  averse  and  returns  are  distributed  multivariate  normal.  Dybvig,  assumes  the 
market  portfolio  is  efficient.  Again,  these  derivations  involve  additional  assumptions, 
but  the  assumptions  are  still  somewhat  appealing,  and  the  results  show  that  relative  to 
the  measurement  error  in  expected  returns  the  pricing  error  is  very  small. 

So  far,  every  theoretical  version  of  the  APT  has  assumed  homogeneous  beliefs 
regarding  the  expectations  generating  process.  Connor  and  Korajczyk  (1986)  update 
their  equilibrium  version  of  the  APT  to  allow  an  infinitesimal  number  of  informed 
investors.  Handa  (1991)  shows  that  with  information  uncertainty  the  APT  still  holds 
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as  an  approximate  pricing  relationship.  Further,  he  shows  that  the  pricing  error  will 
be  a  decreasing  function  of  the  information  available  and  an  increasing  function  of 
firm  size  and  idiosyncratic  risk.  This  extends  earlier  work  by  Dybvig  (1983)  and 
Grinblatt  and  Titman  (1983)  who,  assuming  homogeneous  beliefs,  find  that  the  upper 
bound  is  an  increasing  function  of  size. 

2.3.3  Equilibrium  Versions  of  the  APT 

Carrying  the  theory  even  one  step  further,  several  papers  have  developed  an 
equilibrium  version  of  the  APT,  where  equation  (2.9)  becomes  an  equality.  Chen 
and  Ingersoll  (1983)  suggest  the  APT  will  price  all  assets  correctly  if  there  exists  a 
portfolio  whose  return  has  no  idiosyncratic  risk  and  some  expected  utility  maximizer 
(with  a  continuously  differentiable,  increasing,  and  strictly  concave  utility  function) 
finds  that  portfolio  to  be  optimal.  Connor  (1984),  Chamberlain  (1983),  and  Grinblatt 
and  Titman  (1983)  develop  equilibrium  models  based  on  the  existence  of  a  well- 
diversified  portfolio  on  the  mean- variance  frontier  and  by  placing  restrictions,  such  as 
risk  aversion,  on  the  investor's  utility  functions. 

Lee,  Park,  and  Wei  (1993)  develop  an  equilibrium  APT  in  the  same  spirit  as 
Shanken's  (1985)  theory  and  the  unified  APT/CAPM  of  Wei  (1988).  Essentially 
these  models  show  that  even  without  assuming  perfect  diversity  in  the  market 
portfolio,  as  done  by  the  above  papers,  the  market  portfolio  will  enter  the  pricing 
relation  as  a  priced  risk.  Chamberlain  (1988)  develops  an  intertemporal  equilibrium 
model  that  links  the  APT  with  Merton's  (1973)  Intertemporal  Capital  Asset  Pricing 
Model  (ICAPM).  While  these  equilibrium  versions  of  the  APT  give  an  exact 
equilibrium,  expected  return  relationship,  they  also  open  the  door  to  a  joint 
hypothesis  problem  similar  in  spirit  to  tests  of  the  CAPM. 

Finally,  Handa  and  Linn  (1991)  develop  an  equilibrium  APT  in  an  economy 
where  investors  have  heterogeneous  beliefs  and  differing  information  sets.  In  a 
following  paper,  Handa  and  Linn  (1993)  suggest  that  betas  estimated  with  past  returns 
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are  complete  information  betas  and  ignore  the  fact  that  bayesian  investors  will  account 
for  estimation  risk  (i.e.,  they  do  not  ignore  variation  of  information  across  assets). 
They  argue  that  when  using  complete  information  betas,  average  return  deviation  will 
be  correlated  with  any  information  proxy  (e.g.,  size),  and  as  the  number  of  securities 
are  increased,  so  will  the  number  of  factors.  This  finding  has  obvious  implications 
on  the  anomalies  literature. 


2.4  Testability  of  the  APT 

Since  the  focus  of  this  dissertation  is  on  the  viability  of  the  APT,  it  is 
important  to  address  the  theory  relating  to  the  APT's  testability.  Shanken  (1982) 
presents  the  first  primary  challenge  to  the  testability  of  the  APT.  Shanken  has  two 
major  contentions.  First,  as  previously  discussed,  the  APT  is  developed  in  an  infinite 
economy,  yet  empirical  tests,  by  definition,  take  place  with  a  finite  number  of  assets. 
Second,  Shanken  suggests  that  one  can  fundamentally  alter  the  nature  of  the 
underlying  factor  structure  by  simply  rearranging  the  underlying  securities  into 
equivalent  portfolios. 

Dybvig  and  Ross  (1985)  argue  that  Shanken' s  contentions  are  groundless. 

They  suggest  that  the  APT  is  robust  to  linear  transformations  and  that  the  factors 
should  remain  stable.  Grinblatt  and  Titman  (1985)  show  that  Shanken's  argument 
about  repackaging  securities  to  hide  factors  is  invalid,  because  the  variance  of  the 
repackaged  security  will  approach  infinity  and  be  correctly  identified  as  a  factor. 
Shanken  (1985)  however,  maintains  that  the  APT  is  not  testable  and  that  most 
researchers  test  predictions  based  on  the  relatively  restrictive  equilibrium  APT,  yet 
use  the  assumptions  of  the  less  restrictive  approximate  APT.  Shanken  argues  the 
equilibrium  versions  are  not  testable  because  they,  like  the  CAPM,  involve  the 
market  portfolio. 
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Grinblatt  and  Titman  (1987)  suggest  that  there  is  a  big  difference  between  the 
global  mean-variance  efficiency  of  the  market  portfolio  in  the  CAPM  and  the  APT 
saying  that  for  "subsets"  of  the  economy,  the  proxy  portfolio  is  locally  mean- variance 
efficient  relative  to  the  assumed  factor  structure.  They  suggest  that  with  techniques 
allowing  for  large  numbers  of  securities  and  factors  to  be  analyzed  it  is  possible  to 
construct  a  portfolio  whose  idiosyncratic  risk  is  virtually  eliminated. 

More  recently,  Shanken  (1992)  and  Reisman  (1992)  suggest  that  the 
approximate  APT  relationship  is  a  tautology  for  any  finite  set  of  assets.  If  the  pricing 
bound  is  known,  then  there  is  nothing  to  test.  This  is  because  the  finite  bound  on  the 
pricing  error  will  absorb  any  error  due  to  misspecification  of  the  factors. 

2.5  Summary 

In  summary,  the  theoretical  work  on  the  APT  seems  to  indicate  that  it  should 
hold,  either  approximately  or  in  equilibrium  under  varying  assumptions.  The 
assumptions  are  not  overly  restrictive,  especially  for  traded  assets,  and  since  the  APT 
is  valid  on  subsets,  one  can  reasonably  expect  the  APT  to  hold.  The  following 
chapter  looks  at  the  empirical  literature  concerning  the  APT. 


CHAPTER  3 

REVIEW  OF  THE  LITERATURE 


Whereas  the  previous  chapter  provided  a  brief  theoretical  review  of  the  APT 
and  its  early  development,  this  chapter  focuses  on  the  empirical  literature  surrounding 
the  APT.  Where  necessary,  theoretical  papers  will  be  discussed  as  they  apply  to  new 
methodologies.  The  purpose  of  this  chapter  is  not  to  provide  a  comprehensive  review 
of  every  paper  that  has  tested  the  APT.  Rather,  we  want  to  highlight  those  papers 
that  have  made  a  significant  development  or  will  be  directly  applicable  to  the  research 
at  hand.  Specifically,  we  concentrate  on  those  papers  that  look  at  determining  the 
number  of  priced  factors  and  the  stability  of  these  factors  and  prices.  Connor  and 
Korajczyk  (1995)  provide  a  thorough  review  of  the  APT  and  the  empirical  literature. 
As  mentioned  in  the  previous  chapter,  we  focus  on  those  papers  that  have  used 
Connor  and  Korajczyks's  asymptotic  principal  component  technique  or  pre-specified 
macroeconomic  variables.  Other  techniques  are  discussed  as  necessary.  Chapter  four 
will  review  specific  methodology  and  testing  as  necessary. 

3.1  Testing  the  APT 

Before  looking  at  the  specific  implementations  and  results  of  various  tests  of 
the  APT,  it  is  worthwhile  to  look  at  the  task  at  hand.  A  theory  can  generally  be 
tested  by  examining  its  assumptions  or  its  implications.  This  research  looks  at  both 
types  of  tests.  First,  regarding  the  assumptions,  the  APT's  primary  assumption  is 
that  returns  are  generated  by  a  linear  k  factor  function  (equation  (2.2)  in  the  previous 
chapter). 
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r{  =  E  +  Bft  +  e,  <31> 

The  other  assumptions  will  be  different,  depending  on  which  version  of  the  APT  one 
is  investigating  (e.g.,  strict  versus  approximate  factor  structure).  Clearly,  if  one  can 
demonstrate  that  returns  are  not  generated  by  k  pervasive  factors,  the  other 
assumptions  do  not  matter,  as  the  APT  will  not  hold. 

The  primary  implication  of  the  APT  is  it's  pricing  relationship  (equation 
(2.11)  in  the  previous  chapter), 

E  =  \  +  KB  (3.2) 

Tests  of  this  implication  have  generally  focused  on:  1)  how  many  factors  are  priced, 
2)  is  the  intercept  term  equal  across  groups,  and  3)  are  any  other  variables  priced?  In 
addition,  numerous  papers  have  looked  at  how  well  the  APT  performs  relative  to  the 
CAPM  in  either  explaining  the  cross-section  of  returns  or  explaining  the  various 
pricing  anomalies. 

The  t  subscript  has  been  added  to  equation  (3.1)  to  illustrate  that  for  most 
empirical  analysis  we  will  work  with  a  time-series  of  t  return  observations  (i.e.,  t  = 
1,  ...,  T).11  We  also  will  use  an  equilibrium  version  so  that  a  strict  equality  holds. 
From  these  two  equations,  we  notice  that  one  must  estimate  all  of  the  parameters 
using  only  the  observed  returns,  since  none  of  the  other  parameters  are  known.  Most 
of  the  empirical  procedures  are  based  on  a  two-step  procedure,  similar  to  that  which 
Fama  and  MacBeth  (1973)  used  in  their  seminal  analysis  of  the  CAPM.  In  the  first 
step  we  use  time-series  data  to  estimate  the  factors  (f)  and  their  beta's  (B).  If  the 
factors  are  known  a-priori,  or  proxies  have  been  specified  for  them,  as  in  the  case  of 
macroeconomic  variables,  the  task  is  somewhat  straightforward.  Without  knowing 
the  factors  we  must  extract  the  two  terms  (f  and  B)  simultaneously,  which  is 


uAs  will  be  discussed  later,  some  empirical  tests  also  allow  for  time  varying  risk 
premia  and  beta's. 
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complicated  by  the  fact  that  they  enter  into  equation  (3.1)  multiplicatively.  In  the 
second  step,  we  use  cross-sectional  regressions  to  estimate  the  factor's  associated 
prices  (k).  Factor  analysis  was  the  first  technique  used  to  extract  the  factors  in  the 
first  of  these  two  steps. 


3.2  Factor  Analysis 

Although  this  effort  will  be  relying  primarily  on  Connor  and  Koraczyk's 
(1986,  1988)  asymptotic  principal  component  technique,  it  is  useful  to  first  review  the 
technique  of  factor  analysis.  This  technique  is  used  in  many  of  the  early  empirical 
tests  of  the  APT  and  the  various  other  techniques  were  designed  to  overcome  some  of 
the  problems  inherent  in  the  factor  analysis  approach. 

3.2.1  Theoretical  Foundations 

Roll  and  Ross  (1980)  is  the  seminal  empirical  work  in  this  area.  This  section 
follows  their  theoretical  development.12  Roll  and  Ross  start  with  Ross’s  original  strict 

factor  model,  as  given  by  equations  (3.1)  and  (3.2),  where  E[et  et  ]  =  a 

diagonal  matrix  of  own  asset,  idiosyncratic  variances.  They  further  assume  that  all  of 

the  relevant  factors  are  accounted  for,  so  that  E[et  |  /(]  =  0.  With  this  relationship. 

Roll  and  Ross  show  that  the  population  covariance  matrix,  V  = 

E[(r(  -  E)  (rt  -  £);]  can  be  decomposed  as  follows: 

V  =  BLB'  +  Ed  (3.3) 

where  B  is  the  matrix  of  factor  loadings,  L  is  the  matrix  of  factor  covariances,  and 
ZD  is  as  defined  above.  In  essence,  we  have  decomposed  the  -overall  covariance  into 
systematic  and  non-systematic  components.  It  is  important  to  note  that  this 
specification  is  not  unique.  Consider  any  orthogonal  matrix  G  (i.e.,  GG'  =  I). 


12The  notation  has  been  changed  for  consistency  in  this  paper. 
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When  one  estimates  B  from  the  sample  covariance  matrix  v,  all  linear 
transformations  BG  will  be  equivalent.  This  can  easily  be  seen  from  equation  (3.3), 

V  =  (BG)(G/  AG){BG)'  +  SD 

=  BGG'AGG1  B'  +  VD  (3.4) 

=  BAB1  + 

This  problem  makes  comparison  among  factors,  or  the  pricing  of  individual  factors, 
almost  impossible.  For  example,  factors  1  and  2  could  switch  place  or  we  could 
scale  up  factor  j's  loadings  by  a  scalar  g  and  scale  down  factor  j  by  the  same  scalar 
g,  and  still  obtain  the  same  return  relationship.  This  scaling,  however  would  also 
have  an  influence  on  the  price  of  factor  j. 

Roll  and  Ross  suggest  a  maximum-likelihood  approach  to  estimate  B.  Roll 
and  Ross  also  assume  the  factors  are  orthogonal  and  scaled  to  have  unit  variance. 
Therefore  L  becomes  the  identity  matrix.  This  condition  can  always  be  assured 
through  scaling  and  rotating  the  factors. 

After  estimating  the  B  matrix  in  this  manner,  the  second  step  involves 
estimating  the  X’s.  By  substituting  equation  (3.2)  into  equation  (3.1)  we  obtain 

A  =  f,  ~  K  =  B<x  +  ~et  (3  5) 

=  Bk  +  Bft  +  e{ 

Roll  and  Ross  then  note  that  the  cross-sectional  regressions  will  be  biased  by  the 
time-series  sample  mean  of  the  factors.  As  the  sample  size  n  increases,  the  bias 
should  approach  zero.  Roll  and  Ross  suggest  the  use  of  a  generalized  least  squares 
(GLS)  cross-sectional  regression  for  each  day  t,  where: 

(• K  +ft)  =  (b'v  'by1  b'v  V, 


(3.6) 
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and  V  is  the  estimated  covariance  matrix  of  Bf(  +  ef, which  is  the  disturbance  term  in 
equation  (3.5).  Then  the  time-series  average  of  the  Xt's  will  yield  an  unbiased 

estimate  of  X  since  the  expected  value  of  ft  is  zero.  The  covariance  matrix  of  A  will 
be 

BV^B  (3.7) 

which,  by  design,  will  be  diagonal.  This  means  the  estimated  risk  premia  are 
independent. 

One  problem  with  the  Roll  and  Ross  approach  is  the  computational  burden 
associated  with  factor  analysis  of  large  matrices.  For  this  reason  Roll  and  Ross  break 
their  sample  into  groups  of  30  securities  and  use  factor  analysis  individually  on  each 
group.  They  find  at  least  three,  and  possibly  four,  priced  factors  based  on  individual 
Mests  of  significance.  Roll  and  Ross  also  provide  weak  evidence  that  own-variance  is 
not  priced  and  that  the  intercept  term  is  constant  in  the  cross-sectional  regressions. 
Both  of  these  findings  lend  additional  support  to  the  APT. 

3.2.2  Early  Criticisms 

Dhrymes,  Friend,  and  Gultekin  (DFG)  (1984)  offer  three  points  of  criticism 
concerning  the  Roll  and  Ross  approach.13  It  is  important  to  mention  these  criticisms 
since  they  have  driven  the  development  of  many  of  the  other  techniques.  They  also 
cast  a  new  light  on  interpreting  results  from  these  early  papers.  First,  DFG  point  out 
that  due  to  the  linear  transformation  problem,  individual  Mests  are  not  valid.  The 
only  statistically  sound  test  available  is  a  joint  test  (an  F-test)  of  whether  or  not  all  the 
factors  together  are  significant.  Next,  they  show  that  factor  analysis  on  individual 

13Roll  and  Ross  (1984)  take  exception  to  many  of  DFG's  criticisms.  In  a 
subsequent  paper,  however,  Dhrymes,  Friend,  Gultekin,  and  Gultekin  (1985)  provide 
further  evidence  that  factor  analysis  has  serious  flaws,  and  that  the  number  of  factors 
will  increase  with  the  number  of  securities. 
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groups  is  not  equivalent  to  factor  analysis  on  the  entire  sample.  This  is  because  of 
the  lost  off-diagonal  covariance  information.  They  also  point  out  that  each  individual 
group  may  have  different  factors.  They  suggest  that  one  needs  to  yield  to  the  law  of 
large  numbers  and  use  the  entire  sample.14  Finally,  they  empirically  demonstrate  that 
with  Roll  and  Ross's  technique  the  number  of  significant  factors  will  increase  as  the 
number  of  securities  in  the  group  is  increased. 

Overall,  they  think  that  there  are  more  than  five  factors.  They  also  point  out 
that  the  most  dominant  factor's  loadings  remain  somewhat  stable  as  individual 
securities  are  added  or  deleted  from  a  group;  however  the  loadings  for  the  other 
factors  drastically  change.  Therefore  it  is  difficult  to  make  any  inferences  from  the 
estimated  loadings.  This  is  also  the  first  evidence  that  the  nondominant  factors  (i.e., 
those  factors  other  than  the  first),  might  be  unique  to  the  sample  at  hancf  and  not 
indicative  of  an  economy  wide,  pervasive  factor. 

3.2.3  Early  Empirical  Results 

Numerous  other  researchers  used  the  basic  Roll  and  Ross  methodology  and  at 
this  stage,  the  evidence  on  the  APT  is  mixed.  This  section  will  highlight  some  of 
these  results  and  look  at  some  extensions  and  modifications  to  the  method  of  factor 
analysis. 

Chen  (1980)  demonstrates  that  if  you  know  the  factor  loadings  for  k 
portfolios,  you  can  compute  k  factor  loadings  for  any  asset,  say  p,  if  you  know  the 
pth  asset's  covariance  with  the  other  k  portfolios.  This  diminishes  the  computational 
burden  of  factor  analysis;  however,  it  forces  a  common  factor  model  (which  is 
estimated  with  a  small  number  of  securities)  on  all  securities.  In  a  subsequent  paper, 
Chen  (1983)  compares  the  performance  of  the  CAPM  to  the  APT.  He  finds  that  the 
APT  does  a  better  job  of  explaining  the  cross-section  of  expected  returns.  Chen  also 


14Unfortunately,  computational  limitations  prohibit  Roll  and  Ross  from  doing  so. 


26 

shows  that  own  variance  and  firm  size  have  no  explanatory  power.  Reinganum 
(1981),  however,  uses  the  same  methodology,  and  finds  that  the  APT  is  not  able  to 
remove  the  size  anomaly. 

Brown  and  Weinstein  (1983),  in  a  paper  that  might  be  considered  the  most 
closely  related  to  this  effort,  propose  a  new  approach  to  testing  the  APT.15  Brown 
and  Weinstein  propose  a  test  similar  in  spirit  to  that  of  Gibbons  (1982)  multivariate 
test  of  the  CAPM.  They  call  their  test  the  bilinear  hypothesis  and  argue  that  it  is 
applicable  to  all  asset  pricing  models.  Basically,  they  make  a  crude  attempt  to 
compare  factors  among  different  groups.  After  adjusting  their  significance  levels 
downward,  due  to  a  large  sample  size,  they  find  evidence  in  support  of  a  three  factor 
APT,  and  reject  a  five  and  seven  factor  version. 

Brown  and  Weinstein  (1983)  compare  the  factor  structures  across  groups  with 
an  F-test  that  is  based  on  the  unconstrained  and  constrained  error  variances. 
Specifically,  the  statistic  is  based  on  the  difference  between  the  unconstrained  residual 
sum  of  squares  from  factor  analysis  of  two  groups  of  30  securities  and  the 
constrained  residual  sum  of  squares  obtained  from  extracting  the  factors  from  the 
entire  group  of  60  securities.  They  find  that  more  than  three  factors  are  necessary  to 
explain  a  reasonable  amount  of  the  observed  variation,  but  that  the  three  factors  that 
best  explain  the  variation  are  the  same  across  groups. 

Brown  and  Weinstein  claim  this  test  directly  compares  the  factors.  In  actuality 
it  is  a  joint  test  that  compares  the  predictive  ability  of  the  entire  set  of  factors  as  a 
whole.  If  one  of  the  factors  explained  a  large  portion  of  the  variance  their  test  might 
still  say  there  were  three  (or  more)  common  factors.  In  contrast,  the  test  I  propose 
allows  for  a  more  direct  comparison  of  the  factors,  not  their  joint  ability  to  explain 
the  cross-sectional  variation  of  returns. 


15This  paper  will  be  discussed  in  more  detail  in  the  following  chapter. 
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Cho  (1984)  tackles  the  issue  of  factor  comparability  by  employing  inter-battery 
factor  analysis.  This  technique  extracts  the  factors  from  the  inter-group  sample 
covariance  matrix  instead  of  the  full  sample  covariance  matrix.  This  technique  forces 
common  factors  among  the  groups  but  ignores  a  large  amount  of  information  in  the 
overall  covariance  matrix.  Cho  groups  the  securities  based  on  their  SIC  code  and 
finds  five  or  six  common  factors  that  appear  to  be  responsible  for  generating  daily 
security  returns.  Interestingly  the  number  of  factors  does  not  appear  to  be  related  to 
the  size  of  the  groups.  Specifically,  Cho  divides  his  data  into  22  groups  and  then 
looks  at  the  number  of  factors  common  to  each  pair-wise  set  of  adjacent  groups.  The 
results  indicate  that  the  number  of  factors  that  are  common  between  groups  appears  to 
be  stable.  Cho  suggests  there  are  approximately  six  factors.  However,  there  is  no 
guarantee  that  the  factors  are  the  same  (e.g.,  the  six  factors  common  to  groups  one 
and  two  may  be  entirely  different  from  the  six  factors  he  finds  common  to  groups 
seven  and  eight). 

Kryzanowski  and  To  (1983)  compare  various  methods  of  factor  analysis  and 
various  methods  for  determining  the  number  of  significant  factors.  After  extracting 
50  factors  from  various  portfolios  they  conclude  that  standard  likelihood  tests  would 
suggest  ten  or  more  factors,  but  more  stringent  tests  suggest  there  are  less  than  five. 
They  also  make  the  important  observation  that  while  the  first  factor  was  associated 
with  a  large  percentage  of  the  securities,  the  remaining  factors  were  not.  Again,  this 
appears  to  be  evidence  that  the  non  dominant  factors  may  be  sample  specific. 

Other  researchers  have  used  factor  analysis  to  examine  returns  from  different 
markets.  For  example  Beenstock  and  Chan  (1986)  look  at  the  UK  stock  market. 

They  find  that  a  relatively  high  proportion  of  estimated  expected  returns  is  explained 
by  the  APT.  They  find  weak  evidence  that  more  than  20  factors  are  significant. 
Beenstock  and  Chan  find  that  the  APT  does  subsume  the  size  effect,  but  own 
variances  are  priced.  Abeysekera  and  Mahajan  (1987)  also  look  at  the  UK  market 
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and  find  mixed  results.  They  find  that  the  intercept  terms  are  equal  across  groups, 
but  the  risk  premia  are  jointly,  insignificantly  different  from  zero. 

Gultekin  and  Gultekin  (1987)  show  that  empirical  tests  of  the  APT  are  very 
sensitive  to  the  January  effect.  Basically  they  argue  that  the  APT  can  explain  the  risk 
return  relationship  only  in  the  month  of  January.16  Cho  and  Taylor  (1987),  partially 
in  an  attempt  to  explain  this  finding  by  Gultekin  and  Gultekin,  examine  the  month  by 
month  stability  of  the  APT.  Specifically  they  look  at  the  stability  of:  1)  daily 
returns,  2)  the  covariance  and  correlation  matrices,  3)  the  number  of  factors,  and  4) 
the  APT  pricing  relationship.  Unfortunately,  as  they  admit,  many  of  their  tests  are 
very  weak  due  to  the  non-uniqueness  of  factor  loadings.  Overall,  they  reject  the  APT 
and  find  that  January  does  have  a  different  pricing  behavior. 

Lehman  and  Modest  (1988)  develop  a  factor  analysis  technique  for  large 
samples  that  they  claim  is  computationally  more  efficient  than  principle  components. 
They  find  that  after  adjusting  for  the  APT  risk  factors,  dividend-yield  and  own 
variance  play  no  role  in  estimating  expected  returns,  whereas  these  variables  have 
been  shown  to  be  related  to  excess  returns  in  a  CAPM  model.  They  do  not  find, 
however,  that  the  APT  is  able  to  account  for  the  size  effect.17  Lehmann  and  Modest 
stress  the  importance  of  testing  the  expected  return  relationship  with  "out  of  sample" 
data  (i.e.,  security  returns  that  were  not  used  to  estimate  the  factor  model).  They 
argue  that  factor  analysis  has  a  tendency  to  over  fit  the  data  and  therefore  these  "out 


16Tinic  and  West  (1984)  document  the  same  finding  for  the  CAPM.  The  January 
effect  is  linked  with  many  of  the  other  anomalies,  in  particular,  most  or  all  of  the  size 
effect  has  been  shown  to  occur  in  January. 

17Most  of  the  mispricing  is  concentrated  in  the  smallest  and  largest  firms,  so  they 
suggest  that  APT  does  fit  most  listed  firms  with  little  error. 
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of  sample"  tests  are  important.18  This  research  will  father  support  this  hypothesis  by 
demonstrating  that  many  of  the  apparently  significant  factors  are  unique  to  a 
particular  sample. 

In  a  similar  vein,  Conway  and  Reinganum  (1988)  use  the  cross-validation  to 
test  for  a  stable  factor  structure.  Cross-validation  involves  fitting  a  model  with  one 
set  of  data  and  then  seeing  how  well  it  predicts  out  of  sample.  If  a  factor  is  economy 
wide,  or  pervasive,  it's  inclusion  will  cause  a  decline  in  the  out  of  sample  prediction 
error.  If  the  model  is  over-fit  and  the  additional  factor  is  sample  specific  the  out  of 
sample  prediction  error  will  remain  the  same  or  even  increase.  Conway  and 
Reinganum  are  perhaps  the  first  to  clearly  document  that  factors  extracted  from 
randomly  selected  securities  may  be  specific  to  that  particular  sample.  They  find  that 
common  likelihood  ratio  statistics  identify  four  or  more  factors  whereas  cross- 
validation  suggests  only  one  or  two  of  these  factors  are  pervasive.  Conway  and 
Reinganum  base  the  adequacy  of  their  technique  on  very  limited  simulation  data. 

They  look  at  the  ability  of  their  cross-validation  technique  to  distinguish  between 
pervasive  and  sample  specific  factors  in  one-factor  and  two-factor  models.  Their 
results  may  also  be  sensitive  to  the  rotational  indeterminacy  of  extracted  factors. 

Jobson  (1988)  proposes  two  indexes  for  goodness  of  fit  that  can  be  used  in 
place  of  cross-validation.  This  allows  the  researcher  to  use  more  of  the  sample  in 
estimating  the  model.  The  indexes  are  CA  (criterion  of  Akaike),  developed  by 
Akaike  (1974)  and  CS  (criterion  of  Schwartz),  developed  by  Schwartz  (1978).  Both 
the  CA  and  CS  index  are  penalty  functions  that  penalize  the  likelihood  statistic  as 
more  parameters  are  added.  Jobson  demonstrates  that  by  using  the  CS  index  the 
number  of  factors  found  are  similar  to  the  results  of  Conway  and  Reinganum  (1988). 


18Bower,  Bower,  and  Logue  (1984)  showed  that  the  APT  performed  better  than 
the  CAPM  at  explaining  out  of  sample  returns.  However,  their  entire  analysis  was 
done  using  only  utility  stocks,  and  this  will  favor  the  APT. 
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Stambaugh  (1988)  argues  that  Conway  and  Reinganum  (1988)  based  the 
adequacy  of  their  technique  on  very  limited  simulation  data.  He  suggests  that  more 
work  needs  to  be  done  using  simulated  data  to  test  the  techniques  based  on  various 
factor  structures.  Brown  (1988)  also  questions  the  Conway  and  Reinganum  (1988) 
study.  Brown  shows  that  if  an  economy  has  five  equally  important  factors,  factor 
analysis  will  identify  one  dominant  factor  (that  will  continue  to  grow  with  the  number 
of  securities)  and  four  significant,  but  minor  factors.19 

3.2.4  Summary 

As  the  above  discussion  illustrates,  the  early  evidence  on  the  APT  is  somewhat 
mixed.  Clearly,  it  seems  as  if  the  APT  does  do  better  than  the  CAPM  at  explaining 
the  cross-section  of  expected  returns  and  it  seems  to  account  for  some  of  the 
anomalies  associated  with  the  CAPM.  Determining  the  number  of  priced  factors 
seems  to  be  the  biggest  obstacle.  In  fact,  at  this  stage  it  appears  to  be  more  of  an  art 
than  an  exact  science.  With  some  exceptions,  there  appears  to  be  a  disturbing  trend 
that  as  the  sample  size  increases,  so  does  the  number  of  factors.  It  also  appears  that 
the  first  factor  is  somewhat  dominant  and  the  other  factors  may  or  may  not  be 
pervasive.  Although,  this  result  might  be  due  to  the  rotational  indeterminacy  of 
factor  analysis.  It  seems  that  simulation  could  easily  be  used  to  validate  the  ability  of 
various  techniques  to  accurately  identify  "priced"  factors  under  varying  factor 
structures.  This  would  go  a  long  way  in  helping  the  researcher  understand  the 
implications  of  the  rotational  indeterminacy. 

As  discussed  in  Chapter  2,  the  APT  is  valid  on  subsets,  but  since  we  are 
dealing  with  an  equilibrium  version  of  the  APT,  we  need  a  large  sample  to  diversify 


19Although  Brown’s  (1988)  work  sheds  much  light  on  the  empirical  observation 
that  one  factor  is  usually  dominant,  it  seems  that  if  the  other  four  minor  factors  are 
still  pervasive  they  would  be  picked  up  in  Conway  and  Reinganum's  (1988)  cross- 
validation  technique. 
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away  the  idiosyncratic  risk.  Factor  analysis  limits  the  sample  size  and  makes  it  hard 
to  make  inferences  about  economy  wide,  pervasive  factors.  This  limitation  is 
addressed  in  the  next  section. 

3.3  Asymptotic  Principal  Components 

The  criticisms  of  DFG,  and  the  mixed  results  of  the  early  empirical  studies, 
illustrate  the  motivation  for  researchers  looking  at  alternatives  to  factor  analysis  in 
empirical  tests  of  the  APT.  Connor  and  Koraczyk's  (1986,  1988)  asymptotic 
principal  components  technique  is  perhaps  the  most  widely  used  of  these  alternatives. 
The  technique  of  asymptotic  principal  components  allows  one  to  treat  the  factors  as 
unobservables,  like  factor  analysis  does.  However,  it  allows  the  researcher  to 
overcome  some  of  the  computational  problems  associated  with  factor  analysis.  Much 
of  the  empirical  work  is  based  on  a  theoretical  result  from  Chamberlain  and 
Rothschild  (1983)  that  was  discussed  in  the  preceding  chapter. 

3.3.1  Theoretical  Foundations 

Principal  components  analysis  is  simply  another  method  of  extracting  the 
factors  from  returns  data.  The  primary  difference  between  principal  components  and 
factor  analysis  is  that  factor  analysis  estimates  the  idiosyncratic  risks  at  the  same  time 
it  estimates  the  factors;  principal  components  on  the  other  hand  ignores  the 
idiosyncratic  risks.  Principal  components  attempts  to  explain  as  much  as  possible  of 
the  entire  cross-sectional  variation  in  returns.  Factor  analysis  only  looks  at  the 
variation  common  to  all  securities.  Principal  component  factors  are  based  on  the 
eigenvectors  of  the  sample  covariance  matrix.  Therefore,  both  factor  analysis  and 
traditional  principal  components  require  working  with  an  nx  n  covariance  (or 
correlation  matrix).  Connor  and  Korajczyk's  (1986,  1988)  gain  in  efficiency  comes 
from  their  development  of  an  asymptotic  principal  components  technique  for 
extracting  the  factors  assuming  an  approximate  factor  structure. 
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With  an  approximate  factor  structure,  the  relationship  in  equation  (3.3) 
becomes: 

V  =  BB'  +  Zr  (3.8) 

where  B  is  the  matrix  of  factor  loadings,  2*  is  a  matrix  with  uniformly  bounded 
eigenvalues,  and  again  we  assume  that  the  factors  are  orthogonal  and  scaled  to  have 
unit  variance.  Note  however,  that  unlike  with  a  strict  factor  structure,  we  allow  2*  to 
have  off-diagonal  terms.  It  also  assumes  knowledge  of  the  actual  population 
covariance  matrix  of  asset  returns. 

Connor  and  Korajczyk  (1986)  first  show  that  statistics  based  on  the  observed 
sample  covariance  matrix  will  converge  to  the  statistics  based  on  the  population 
covariance  matrix.  More  importantly,  they  also  show  that  one  can  use  a  T  xT 
dimensional  matrix,  rather  than  the  normal  n  x  n  dimensional  sample  covariance 
matrix.  The  following  paragraphs  will  closely  follow  their  original  development.20 

First,  Connor  and  Korajczyk  (1986)  suggest  the  following  empirical 
specification.  They  start  with  equation  (3.5), 


Rt  =  B(Xt  +ft)  +  e, 

E[et  \ft]  =  0,  E[ft]  =  0,  E[ 6,6f]=S* 


(3.9) 


The  t  subscript  has  been  added  to  the  Vs  to  note  that  this  technique  allows  for  time 
varying  risk  premia.  Actually,  with  the  Fama-Macbeth  approach  used  by  Roll  and 
Ross  (1980),  one  could  argue  that  factor  analysis  also  allowed  for  time-varying  risk 
premiums.  Next,  assume  we  have  observed  returns  on  n  securities  for  t  periods. 

Remembering  that  R  f  are  excess  returns  (returns  in  excess  of  the  riskless 

rate),  the  observed  n  xT  matrix  of  excess  returns  is  given  by  RN  =  r"  -  lnrf/  where 
rn  is  the  n  xT  matrix  of  observed  returns,  and  rf  is  a  T-vector  of  observed  returns 


20 Again,  the  notation  is  changed  for  consistency. 
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on  the  riskless  asset.  The  risk-free  rate  can  also  vary  with  time.  From  equation  (3.9) 
we  have 

Rn  =  BnF  +  £n  (3.10) 

where  F  is  the  k  x  T  matrix  of  (\  +  Q  and  s"  is  the  n  x  T  matrix  of  realizations 
of  £t  and  BN  is  the  n  x  T  matrix  of  factor  loadings. 

Connor  and  Korajczyk  will  extract  the  principal  components  from  equation 
(3.10).  Factor  analysis,  on  the  other  hand,  uses  equation  (3.1)  and  extracts  the 

factors  from  the  sample  covariance  matrix  [(?”  ~  E)(rn  -  E)']  where  E  js 

estimated  with  the  time  series  mean  of  i".  This  is  a  critical,  but  often  overlooked, 
point.  Any  estimate  of  E  is  biased  by  the  time  series  mean  of  the  risk-premia,  as  can 
be  seen  by  equation  (3.2). 

Ehrhardt  (1987)  is  the  first  to  discuss  this  issue  in  detail.  He  argues  that  most 
studies  use  a  test  statistic,  based  on  the  estimated  errors,  to  determine  whether  there 
is  a  sufficient  number  of  factors.  In  most  of  the  factor  analysis  papers  these 
estimated  errors  come  from  equation  (3.1);  Ehrhardt  argues  they  should  come  from 
equation  (3.5).  Ehrhardt  offers  an  intuitive  explanation  as  to  the  differences  of  these 
approaches.  With  equation  (3.1)  the  intercept  term  Ej  is  different  for  each  asset,  but 
the  same  in  every  period.  With  equation  (3.5),  however,  the  intercept  is  the  risk-free 
rate  which  will  be  the  same  for  each  security  but  can  be  different  for  each  period.  So 
with  equation  (3.1)  we  can  always  drive  the  estimated  residuals  to  appear  as  if  they 
are  distributed  independently,  with  equation  (3.5)  however,  we  cannot. 

Connor  and  Korajczyk  (1986)  next  define 

Qn  =  (1/n)  Rn/R"  (3.11) 

which,  from  equation  (3.10)  is  equal  to 

Q"  =  (l/ri)F,Bn'  BnF  +  (1  ln)(F'Bnen  +  en  Bn  F)  +  (l/n)en/e" 

=  An  +  Yn  +  Zn 


(3.12) 
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Let  GN  be  the  observed  principal  components  matrix  of  CT  and  H"  be  the 
unobservable  principal  components  of  An.  Gn  is  a  k  x  T  matrix  of  the  k  eigenvectors 
associated  with  the  k  largest  eigenvalues  of  Q".  Connor  and  Korajczyk  show  that  H" 
will  be  a  non-singular  transformation  of  F  (which  is  what  they  want  to  extract). 

Next,  they  show  that  Y"  approaches  zero  as  n  increases,  and  Zn  approaches  cr2  IT  for 
large  n.  So  from  equation  (3.12) 

lim  G"  =  A  ”  +  a%  (3.13) 

n^oo 

and  since  the  eigenvectors  of  a  matrix  do  not  change  with  the  addition  of  a  scalar 
matrix,  the  eigenvectors  of  Q"  will  approach  those  of  A"  as  n  grows  large.  More 
formally,  they  prove, 

Gn  =  LnF  +  where  plim  =  0  (3.14) 

n-*°° 

Therefore  one  can  use  G"  as  an  estimate  of  the  factor  loadings  in  the  second  stage 
cross-sectional  regressions. 

This  technique  effectively  lets  a  researcher  estimate  the  factors  for  a  huge 
number  of  securities  without  having  to  break  them  into  groups  as  Roll  and  Ross 
(1980)  did.  However,  individual  Mests  will  still  not  be  valid  since  the  factors  are 
only  determined  up  to  a  non-singular  transformation.21  As  with  factor  analysis,  this 
technique  assumes  the  value  of  k  is  known.  Connor  and  Korajczyk  suggest  that  one 
feasible  test  for  the  correct  value  of  k  is  to  make  sure  that  the  first  k  eigenvalues  of 
Qn  continue  to  grow  as  n  increases,  whereas  the  last  T  -  k  eigenvalues  of  Q1  become 
equal  (bounded). 


21There  is  some  confusion  on  this  matter  in  the  literature.  Shukla  and  Trzcinka 
(1980)  state  that  since  the  approximate  factor  structure  is  unique,  up  to  a  scalar 
transformation,  individual  Mests  are  valid. 
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3.3.2  Empirical  Results  and  Extensions 
In  a  subsequent  paper,  Connor  and  Korajczyk  (1988)  use  their  asymptotic 
principal  components  technique  in  an  empirical  study.  They  first  offer  a  new 
procedure  for  estimating  the  matrix  G",  which  is  an  iterative  technique  similar  to  the 
use  of  weighted-least  squares  in  regression.  They  point  out  that  with  large  samples 
there  is  little  gain  from  this  technique  but  there  might  be  gains  with  small  samples. 
Presumably,  one  uses  the  asymptotic  principal  components  technique  to  take 
advantage  of  using  a  large  number  of  securities,  so  this  iterative  technique  does  not 
seem  too  important  and  will  not  be  used  in  this  research. 

Empirically,  they  look  at  estimating  factors  and  risk  premiums  for  four  non¬ 
overlapping  five  year  periods,  where  each  time  period  has  between  1487  and  1745 
securities  (as  compared  to  Roll  and  Ross's  (1980)  groupings  of  30  securities). 

Connor  and  Korajczyk  look  at  one  factor,  five  factor,  and  ten  factor  models.  They 
conclude  that  the  five  factor  model  is  the  most  parsimonious  and  it  performs  better 
than  a  standard  CAPM  model  in  explaining  anomalies  such  as  the  January-specific 
mispricing.  However,  even  with  the  APT,  the  size  effect  remains. 

Another  very  interesting  result  in  this  paper  is  Connor  and  Korajczyk' s  use  of 
simulation  to  show  that  asymptotic  principal  components  accurately  estimates  the 
factors.  This  approach  will  be  one  of  the  main  techniques  utilized  in  my  analysis  of 
factor  stability.  They  simulate  asset  returns  that  correspond  to  an  approximate  five 
factor  model  and  then  extract  the  factors  using  their  technique.  They  then  regress 
each  of  the  extracted  factors  on  the  full  set  of  true  factors.  Comparisons  cannot  be 
made  on  a  factor  to  factor  basis  because  Gn  converges  to  L"  F,  not  F.  They  find  that 
their  technique  works  well  even  if  the  idiosyncratic  risks  have  a  correlation  coefficient 
of  0.9  (where  a  strict  factor  structure  will  have  a  coefficient  of  zero,  and  an 
approximate  factor  structure  will  be  somewhere  between  zero  and  one). 
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Connor  and  Korajczyk  (1988)  also  propose  some  new  methods  for  testing 
restrictions  implied  by  the  approximate  factor  structure.  For  example,  let  "a" 
represent  a  vector  of  the  intercept  terms  from  the  cross-sectional  regressions  of  excess 
returns  on  the  factors;  the  theory  states  that  this  vector  should  be  exactly  zero.  They 
find  that  it  is  not  identically  zero  but  it  seems  to  perform  as  well  as  or  better  than 
similar  mispricing  tests  using  a  CAPM  model. 

Trzcinka  (1986)  looks  at  the  behavior  of  the  eigenvalues  from  a  sample 
covariance  matrix  as  the  number  of  securities  is  increased.  He  finds  that  only  the 
first  eigenvalue  grows  without  bound,  but  the  first  five  appear  to  be  growing  more 
distinct.  In  a  theoretical  paper.  Brown  (1989)  shows  that  the  evidence  shown  in 
Trzcinka  is  consistent  with  an  economy  with  k  equally  important  factors.22  Unlike, 
factor  analysis  however,  the  principal  components  solution  is  unique.  Brown  shows 
that  the  principal  components  solution  will  differ  from  the  original  factor  structure  by 
a  particular  rotation.  This  Helmert  rotation,  can  increase  the  significance  of  the  first 
estimated  factor  and  reduce  the  significance  of  the  remaining  factors.23 

McCulloch  and  Rossi  (1990)  have  a  very  interesting  paper  where  they  use 
asymptotic  principal  components  to  extract  the  factors.  They  use  over  2100  firms 
(NYSE  and  AMEX)  from  1986  to  1987.  This  represents  a  significantly  larger  sample 
than  most  of  the  other  papers  reviewed.  They  focus  on  developing  new  tests  for 
determining  the  adequacy  of  the  model.  Their  primary  test  is  a  Bayesian  estimation 
approach  that  looks  at  comparing  posterior  probabilities  on  the  null  (the  APT 
approximate  factor  structure  holds)  and  alternative  hypothesis  (the  APT  does  not 
hold),  based  on  the  estimated  factor  loadings  and  prices.  Their  analysis  shows 


22This  result  is  analogous  to  Brown  (1988)  discussed  in  Chapter  2. 

23The  Helmert  rotation  is  defined  in  Brown  (1989).  Brown  suggests  that  the 
extracted  factors  could  be  transformed  via  an  inverse  Helmert  rotation  to  obtain  the 
true  factors. 
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departures  from  the  model  restrictions.  However,  due  to  the  high  level  of  parameter 
uncertainty  (since  both  the  loadings  and  prices  are  estimated),  they  are  unable  to 
affirm  or  reject  the  APT.  They  also  mention  that  common  rules  for  determining  k, 
such  as  looking  for  clear  breakpoints  in  the  eigenvalues,  may  not  work.  They  find 
that  the  first  eigenvalue  is  significantly  larger  than  the  others,  but  after  that  they 
remain  fairly  close  together.  They  arbitrarily  looked  at  one,  three,  and  five  factor 
models  and  find  at  most  three  pervasive  factors. 

Another,  and  somewhat  more  appealing,  test  for  determining  k,  is  developed 
by  Connor  and  Korajczyk  (1993).  They  agree  with  McCulloch  and  Rossi  (1990)  in 
that  it  is  difficult  to  determine  k  based  on  the  behavior  of  the  eigenvalues.  They  note 
that  standard  likelihood  ratio  tests  will  not  work  unless  a  strict  factor  structure  is 
assumed.  When  they  are  used  with  approximate  factor  structures  they  show  that  the 
bias  will  be  positive  (the  tests  will  tend  to  identify  too  many  factors).  They  develop  a 
statistic  that  is  based  on  the  fact  that  if  the  correct  k  is  chosen,  there  should  be  no 
significant  decrease  in  the  cross-sectional  mean  square  of  idiosyncratic  returns  in 
moving  from  k  to  k  +  1  factors. 

Using  their  newly  developed  test  for  significance,  Connor  and  Korajczyk 
determine  that  from  one  to  six  factors  is  necessary.  The  first  factor  explains  most  of 
the  variation  but  the  second  through  sixth  factors  are  significant  and  explain  much  of 
the  variation  associated  with  returns  in  the  month  of  January.  They  therefore  argue 
for  a  three  to  six  factor  model.  As  with  their  previous  paper,  they  use  simulation 
data  to  show  that,  in  a  world  with  four  equally  important  factors:  1)  their  test  for  the 
number  of  factors  works  fairly  well,  and  2)  the  extracted  factors  are  equal  to  the  true 
factors  (up  to  a  non-singular  transformation).  This  is  important,  in  light  of  the  results 
of  Trzcinka  (1986)  and  Brown  (1989)  discussed  above. 
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3.3.3  Summary 

One  of  the  main  tests  of  this  dissertation  will  be  whether  or  not  security 
returns  are  generated  by  k  economy  wide,  or  pervasive,  factors.  Connor  and 
Korajczyk's  (1986,  1988)  asymptotic  principal  components  technique  has  several 
advantages  over  factor  analysis  in  conducting  such  a  test.  First,  it  allows  one  to  use  a 
large  number  of  securities,  and  diversification  is  critical  in  any  test  of  the  APT. 
Secondly,  simulation  has  shown  that  it  is  possible  to  determine  if  the  extracted  factors 
are  equal  to  the  true  factors. 

Connor  and  Korajczyk's  asymptotic  principal  components  technique  will  be 
used  to  compare  factors  extracted  from  mutually  exclusive  portfolios  of 
NYSE/AMEX/Nasdaq  stocks.  Unfortunately,  this  method  is  not  well  suited  for 
comparing  the  prices  of  the  extracted  factors.  First,  the  factors  are  still  subject  to  a 
linear  transformation,  making  it  impossible  to  compare  the  price  of  the  first  factor 
from  one  sample  to  the  price  of  the  first  factor  from  another  sample.  Second,  the 
traditional  Fama-MacBeth  approach  assumes  a  strict  factor  structure  in  the  second 
stage,  cross-sectional  regressions.  These  issues  will  be  addressed  in  the  following 
sections. 


3.4  Pre-Specified  Factors 

As  discussed  in  Chapter  2,  one  intuitive  way  to  tackle  the  empirical  problem 
of  factor  identification  is  to  simply  pre-specify  the  factors.24  Economic  theory  offers 
insight  as  to  what  factors  might  systematically  affect  the  return  on  a  security  and  the 
plethora  of  recorded  economic  variables  offer  observable  proxies  for  these  factors. 
Although  this  technique  is  nice  in  that  we  can  obtain  time  series  data  on  the 


24In  fact,  this  is  the  only  way  to  "identify"  the  factors  (and  then  we  are  only 
identifying  proxies  for  the  factors).  All  of  the  other  techniques  mentioned  earlier 
extract,  but  do  not  identify,  the  factors  from  the  data. 
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underlying  factors  it  is  also  dangerous  in  that  one  must  arbitrarily  choose  which 
variables  to  use.  As  previously  discussed,  it  also  opens  the  door  to  a  host  of 
econometric  issues.  However,  pre-specified  factors  allow  for  individual  f-tests  for 
significance.  These  t-tests  can  be  run  for  various  sub-samples  and  directly  compared 
to  determine  if  the  price  is  the  same.  For  this  reason,  they  will  be  used  to  provide  a 
cleaner  test  of  the  APT's  pricing  relationship. 

3.4.1  Initial  Results 

Chen,  Roll,  and  Ross  (CRR)  (1986)  is  considered  the  seminal  work  in  this 
area.  Several  studies  have  relied  on  their  basic  methodology.  This  section  will 
highlight  some  of  the  major  results.  The  following  section  will  look  at  various 
refinements  and  advances  to  their  technique.  CRR  propose  a  list  of  several  relevant 
macroeconomic  factors  and  then  identify  observable  proxies  for  each  of  them.  Each 
of  the  proxies  has  to  measure  the  unanticipated  movement  (or  innovation)  of  the 
relevant  factors.  If  one  simply  used  the  underlying  variable,  without  accounting  for 
expected  movement,  it  would  result  in  an  errors-in-variables  problem,  thus  rendering 
the  usual  statistical  tests  useless. 

CRR  point  out  that  there  are  two  logical  approaches  for  obtaining  the 
innovations.  First,  since  many  monthly  rate  of  return  observations  and  changes  in 
growth  rates  generally  show  little  serial  correlation,  they  could  be  used  directly  as 
innovations.  Alternatively,  one  could  specify  an  equation  (e.g.  an  autoregressive 
model)  for  expected  movement  and  use  the  residuals  as  the  innovations.  They 
suggest  that  there  is  a  tradeoff  between  the  errors  caused  by  using  monthly  growth 
rates  (which  might  not  have  completely  filtered  out  the  expected  movement)  and  the 
error  introduced  by  incorrectly  specifying  the  expected  movement  equation  if  one 
tries  the  alternative  approach.  CRR  choose  the  somewhat  simpler  approach  of  using 
changes  in  growth  rates  or  actual  growth  rates  as  the  innovations. 
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CRR  look  at  the  following  macroeconomic  variables  as  proxies  for  the 
underlying  factors:  industrial  production  (monthly  and  yearly),  inflation  (both 
unanticipated  inflation  and  changes  in  expected  inflation),  a  risk  premium  variable  (as 
measured  by  the  delta  between  low-grade  corporate  bonds  and  long-term  government 
bonds),  and  a  term  structure  variable  (as  measured  by  the  delta  between  long  term 
government  bonds  and  one  month  treasury  bills).  They  eventually  drop  the  yearly 
industrial  production  variable  due  to  its  high  correlation  with  monthly  industrial 
production.  They  also  look  at  both  the  CRSP  value  and  equally  weighted  market 
index,  real  consumption,  and  oil  prices. 

As  with  most  of  the  other  papers  discussed  in  this  review,  a  variant  of  the 
Fama-MacBeth  approach  is  used  in  their  study.  CRR  first  run  time  series  regressions 
of  the  asset's  returns  on  the  state  variables  to  estimate  the  factor  loadings  (or  beta 
values).  They  then  run  cross-sectional  regressions  of  asset  returns  on  the  estimated 
factor  loadings.  The  cross-sectional  regressions  were  run  month  by  month  for  several 
years  to  obtain  a  time-series  of  the  associated  risk  premiums  for  each  state  variable  or 
factor.  The  time-series  means  are  then  tested  by  a  /-test  for  significant  differences 
from  zero.  This  procedure  forces  a  strict  factor  structure  on  the  returns  in  assuming 
that  the  residuals  are  uncorrelated. 

CRR  group  the  securities  into  portfolios  to  partially  alleviate  the  errors-in- 
variables  problem  caused  by  using  estimated  factor  loadings  in  the  cross-sectional 
regressions.  In  order  to  obtain  good  dispersion  for  the  estimated  beta  values,  CRR 
grouped  the  portfolios  based  on  size,  since  size  is  known  to  be  related  to  returns.25 


25CRR  also  looked  at  various  other  schemes  (e.g.,  stock  price,  betas  on  a  market 
index,  etc.)  for  grouping  the  securities  into  portfolios.  They  found  that  the  grouping 
played  a  large  role  in  which  factors  were  priced.  This  is  discouraging  and  offers  a 
good  area  for  future  research. 
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CRR  first  look  only  at  the  macroeconomic  variables  and  find  that  the 
following  have  significant  risk  premiums  (for  the  entire  sample  period),  where  the 
sign  indicates  whether  the  risk  premium  is  positive  or  negative:  monthly  industrial 
production  (+),  risk  premium  (+),  and  unanticipated  inflation  (-).  They  also  find 
weak  significance  of  expected  inflation  (-),  and  term  structure  (-).  As  one  might 
expect,  there  were  differences  when  the  period  was  broken  into  sub-samples.  For 
example,  the  two  inflation  related  factors  were  highly  significant  between  1968  and 
1977  (a  period  of  high  inflation)  and  insignificant  in  all  other  periods. 

When  they  added  in  the  market  indices  (both  value  and  equally  weighted), 
neither  was  deemed  significant,  although  without  the  other  factors  they  were  both 
individually  found  to  be  significant.  This  seems  to  reject  the  CAPM  model  in  favor 
of  the  APT.  The  proxies  for  consumption  and  oil  prices  were  never  significant. 

CRR  thus  conclude  that  stock  returns  are  exposed  to  multiple  systematic  factors  that 
affect  their  returns  and  that  proxies  for  these  factors  can  be  found  using  simple 
financial  and  macroeconomic  theory. 

Hamao  (1988,  1992)  performs  an  empirical  study  similar  to  CRR  on  the 
Japanese  stock  market.  He  finds  that  expected  inflation,  term  structure,  and  a  risk 
premium  proxy  are  significantly  priced.  Monthly  production,  and  a  trade  term  proxy 
are  weakly  significant.  Changes  in  the  exchange  rate  and  oil  prices  are  not 
significantly  priced.  Interestingly,  he  finds  that  the  inflation  proxy  has  a  positive  risk 
premium,  whereas,  CRR  found  a  negative  risk  premium. 

Chan,  Chen,  and  Hsieh  (CCH)  (1985)  also  use  pre-specified  factors  in  a  paper 
that  examines  the  firm  size  effect.  Their  paper  looked  at  NYSE  stocks  from  1953  to 
1977.  CCH  use  the  same  macroeconomic  variables,  and  methodology,  as  CRR  in  the 
pricing  equation.  They  find  that  their  APT  model  does  capture  the  size  effect. 
Interestingly,  they  find  the  same  significant  factors  as  CRR.  This  is  encouraging 
considering  that  each  paper  is  based  on  overlapping  but  different  sample  periods. 
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Unfortunately,  both  papers  look  at  the  same  macroeconomic  variables.  So  even 
though  they  both  support  the  APT  we  don't  know  if  they  have  identified  all  of  the 
appropriate  factors. 

3.4.2  Econometric  Refinements 

There  is  an  extensive  volume  of  literature  that  looks  at  pre-specified  factors  in 
the  context  of  an  APT.  The  following  paragraphs  will  review  some  of  the  major 
contributions  and  advancements  in  this  area. 

Clare  and  Thomas  (1994),  using  data  from  the  London  stock  exchange, 
provide  another  important  work  in  this  area.  Clare  and  Thomas  start  by  looking  at 
19  proposed  variables,  they  estimate  the  risk  premiums  and  associated  t- values  and 
then  eliminate  the  most  insignificant.  They  then  reiterate  the  entire  process  until  they 
have  found  a  model  in  which  every  risk  premium  is  significant  at  the  10%  level. 

They  consider  the  resulting  model  to  be  the  most  parsimonious  available,  given  their 
list  of  variables. 

Clare  and  Thomas  (1994)  believe  that  CRR's  approach  does  not  truly  give  the 
innovation.  They  point  out  that  CRR's  innovations  are  still  autocorrelated  and 
therefore  not  innovations.  Clare  and  Thomas  use  autoregressive  models  to  provide 
the  innovations. 

Clare  and  Thomas  group  securities  into  portfolios  using  two  methods  (beta  and 
size  rankings).26  When  ordering  by  market  beta,  they  find  seven  priced  factors  priced 
(although  two  of  them  appear  to  be  only  weakly  significant).  When  they  sort  by 
market  size  (as  CRR  and  CCH  did)  they  find  only  two  priced  factors.  This  confirms 
the  sensitivity  of  results  to  the  grouping  scheme  as  reported  in  CRR  and  also  lends 
support  to  the  hypothesis  that  some  of  the  factors  are  sample  specific,  and  not 


26The  astute  reader  will  remember  that  CRR  discussed  the  sensitivity  of  their 
results  to  the  portfolio  grouping  scheme,  but  only  presented  results  from  one  case 
(market  size).  Clare  and  Thomas  present  results  from  both  schemes. 
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pervasive  factors.  Interestingly,  unlike  when  they  sorted  by  market  beta,  the  market 
index  does  have  explanatory  power  when  added  to  this  model. 

Clare  and  Thomas's  results  are  similar  to  those  of  CRR  and  CCH  with  two 
primary  exceptions:  1)  oil  prices  were  not  significant  in  the  other  papers,  and  2)  they 
find  a  positive  premium  on  inflation.  They  suggest  that  these  results  are  due  to  the 
level  of  "net"  oil  exports  in  the  UK  and  the  apparent  fact  that  UK  stocks  were  not  a 
hedge  against  inflation  during  the  period  studied. 

So  far,  most  of  the  papers  discussed  in  this  chapter  have  relied  on  the  Fama 
and  MacBeth  two  step  methodology.  As  previously  mentioned,  this  methodology 
involves  an  errors-in-variables  (EIV)  problem.  CRR  and  many  others  assume  that  the 
EIV  problem  is  virtually  eliminated  by  grouping  the  securities  into  portfolios  during 
the  second  stage  regression.  However,  Shanken  (1992b)  shows  that  after  a  correction 
is  made  to  the  ^-statistic  for  the  EIV  problem,  none  of  the  original  CRR  factors  are 
significant.  Clare,  Priestly,  and  Thomas  (1997)  make  this  same  correction  to  the  data 
in  their  original  study,  discussed  above,  and  also  find  that  many  of  the  factors  become 
insignificant.  Shanken  (1992b)  also  suggests  restrictions  on  those  proxies  specified  as 
differences  in  returns  (since  they  also  should  satisfy  the  pricing  relationship). 

Several  techniques,  have  been  developed  to  overcome  this  EIV  problem. 
McElroy  and  Burmeister  (1988)  develop  a  technique  based  on  iterated  nonlinear 
seemingly  unrelated  regressions  (ITNLSUR).27  Using  macroeconomic  proxies, 
similar  to  those  used  by  CRR,  they  test  a  linear  factor  model  that  admits  the  APT  and 
the  CAPM  as  special  cases.  By  estimating  the  factors  and  their  associated  prices 
simultaneously,  this  technique  avoids  the  EIV  problem.  It  also  assumes  an 
approximate  factor  structure.  Their  evidence  is  supportive  of  the  APT.  Brown  and 
Otsuki  (1992)  employ  the  same  methodology  in  a  study  using  Japanese  securities  and 

27McElroy  and  Burmeister  also  provide  an  excellent  discussion  of  the  econometric 
difficulties  associated  with  the  Fama-MacBeth  approach. 
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an  expanded  array  of  macroeconomic  variables.  They  find  evidence  of  five  or  six 
priced  factors.  The  primary  disadvantage  of  the  ITNLSUR  technique  is  that  it  is 
computationally  burdensome  and  allows  only  a  small  number  of  securities  to  be  used 
relative  to  asymptotic  principal  components. 

Priestley  (1996)  argues  that  both  the  rate  of  change  and  autoregressive 
approaches  to  obtaining  unanticipated  components  of  macroeconomic  variables  are 
flawed.  Specifically,  he  shows  that  the  rate  of  change  methodology  (used  by  CRR 
(1986)  and  others)  provides  autocorrelated  unexpected  components;  and  the 
autoregressive  methodology  (used  by  Clare  and  Thomas  (1994))  allows  agents  to 
make  systematic  forecast  errors.  Priestley  then  shows  that  the  estimated  risk  premia 
are  sensitive  to  the  measurement  of  the  unanticipated  components.  He  finds  that 
innovations  estimated  via  a  Kalman  filter,  do  not  violate  the  above  conditions,  and 
provide  a  better  description  of  expected  returns,  both  in  and  out  of  sample.28 

Kramer  (1994)  uses  a  multi-factor  APT  model  to  study  the  role  of 
macroeconomic  factors  in  explaining  the  January  effect.  The  empirical  significance 
of  January  for  stock  returns  is  highly  documented  in  the  literature.  Kramer  points  out 
that  any  explanation  of  this  phenomenon  must  specify  some  seasonality  in  expected 
returns.  Since  there  is  a  pervasive  sense  of  seasonality  in  the  economy, 
macroeconomic  variables  are  a  good  choice. 

Kramer  looks  at  a  five  factor  model  using  proxies  for  default  risk,  maturity 
risk,  inflation,  consumption  growth,  and  a  stock  market  proxy.29  Kramer  seasonally 
adjusts  the  inflation  and  consumption  factors  to  decrease  the  chance  that  the  factor 


28McElroy  and  Burmeister  (1988)  and  Burmeister  and  McElroy  (1988)  also  used 
Kalman  filter  techniques. 

29The  stock  market  proxy  is  a  time  series  of  residuals  of  a  regression  of  the 
equally  weighted  CRSP  index  on  the  other  four  factors.  So,  in  essence  the  model 
yields  the  CAPM  as  a  special  case. 
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mimics  the  anomaly  and  therefore  yields  a  false  positive  statistic.  Kramer  uses  the 
standard  Fama-MacBeth  method,  but  makes  an  adjustment  in  calculating  the  standard 
errors  to  compensate  for  the  errors  in  variables  problem.  He  concludes  that  the 
multifactor  APT  outperforms  the  standard  CAPM  and  finds  no  evidence  for  a  January 
effect  in  excess  returns  from  his  multifactor  model. 

Elton,  Gruber,  and  Blake  (1995)  [EGB]  look  at  an  APT  model  for  explaining 
returns  in  the  bond  market.  They  use  bond  indexes  as  well  as  macroeconomic 
variables  as  proxies  for  the  factors.  As  always,  the  models  require  using 
unanticipated  changes  or  innovations  in  the  economic  variables.  One  of  the  unique, 
and  very  appealing,  contributions  of  this  paper  is  the  use  of  publicly  available 
forecasts  to  measure  unexpected  changes  in  expectations  for  some  of  their  proxies. 
This  expectational  data  is  probably  a  closer  proxy  for  the  factors  that  actually  drive 
returns.  Unfortunately,  this  data  is  somewhat  scarce  and  generally  only  available  for 
later  time  periods. 

EGB  look  at  a  seven  factor  model  and  find  that  the  two  factors  based  on 
expectational  data  (unanticipated  changes  in  inflation  and  gross  national  product) 
explain  a  large  portion  of  the  return  variance  and  are  significant  at  the  1%  level. 
Another  interesting  point  in  EGB's  study  is  empirical  evidence  that  the  market  return 
indices  are  the  most  important  variables  in  explaining  the  time  series  of  returns  but 
the  macroeconomic  variables  lead  to  a  large  improvement  when  looking  at  the  cross- 
section  of  expected  returns. 

Gangopadhyay  (1996)  uses  a  model  similar  to  CRR  to  show  that  the  seasonal 
mean  reversion  in  stock  portfolio  returns  is  related  to  the  macroeconomic  proxies. 

The  explained  portion  of  the  returns  exhibit  January  mean  reversion  and  the 
unexplained  returns  do  not. 

Most  of  the  previous  papers  have  looked  at  ways  to  improve  the  proxies  or 
methodology  that  CRR  used.  Other  researchers  have  explored  new  areas  for  finding 
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proxies.  Young,  Berry,  Harvey,  and  Page  (1987,  1991)  look  at  the  ability  of 
financial  statement  variables  to  forecast  betas  in  an  APT  sense.  Booth  and  Booth 
(1997)  look  at  the  impact  of  monetary  policy  on  security  returns.  Specifically,  they 
fmd  that  the  federal  funds  rate  and  an  index  based  on  the  change  in  the  discount  rate 
are  significantly  priced.  Jorion  (1991)  provides  weak  evidence  that  exchange  rate  risk 
is  not  a  priced  factor  in  an  APT  setting.30  Conover  (1997)  however  uses  a  larger 
number  of  exchange  rates  and  foreign  interest  rates  and  finds  several  are  priced. 
Chow,  Lee,  and  Solt  (1997)  also  find  evidence  of  priced  exchange  rate  risk  within  the 
context  of  a  Fama-French  (1993)  model. 

3.4.3  Summary 

Numerous  papers  have  shown  that  there  is  a  direct  link  between  various 
macroeconomic  variables  and  the  expected  returns  of  securities.  By  using  these 
variables  as  proxies  for  the  underlying  factors  we  assign  some  economic  meaning  to 
the  APT.  Many  researchers  advocate  these  techniques  because  they  avoid  the 
econometric  difficulties  associated  with  extracting  factors  from  the  data. 

Unfortunately,  they  often  overlook  the  fact  that  the  use  of  these  variables  incorporates 
even  more  econometric  and  theoretical  challenges  (e.g.,  multicollinearity,  arbitrary 
choice  of  variables,  timing  issues,  computing  actual  innovations,  etc.).  However,  this 
technique  allows  one  to  estimate  risk  premia  using  a  large  number  of  securities  and 
then  directly  compare  the  risk  premiums  estimated  from  different  portfolios.  As 
such,  this  technique  will  be  used  to  test  the  stability  of  the  risk  premiums. 


30Many  recent  papers  have  documented  relationships  between  stock  prices  and 
exchange  rates.  See  for  example  Ajayi  and  Mougue  (1996)  and  Bartov,  Bodnar,  and 
Kaul  (1996). 
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3.5  Comparing  the  Various  Techniques 

Asymptotic  principal  components  appears  to  offer  econometric  advantages 
over  factor  analysis.  Macroeconomic  proxies  offer  intuitive  appeal  and  allow  one  to 
compare  risk  premia  across  groups.  Numerous  researchers  have  compared  the 
various  techniques  from  both  a  theoretical  and  empirical  standpoint.  This  section  will 
briefly  review  the  important  findings. 

From  a  theoretical  standpoint,  the  primary  difference  among  the  various 
techniques  is  the  assumption  of  either  a  strict  or  approximate  factor  structure.  Factor 
analysis  assumes  a  strict  factor  structure,  whereas  asymptotic  principal  components 
assumes  an  approximate  factor  structure.  The  Fama-MacBeth  two  step  approach,  as 
used  in  most  of  the  macroeconomic  factor  models,  also  assumes  a  strict  factor 
structure  in  the  second  step  cross-sectional  regressions. 

Burmeister  and  McElroy  (1988)  look  at  the  empirical  implications  of  the 
assumed  factor  structure  using  a  model  with  both  pre-specified  and  unobservable 
factors.31  They  develop  two  new  multivariate  approaches  to  complement  their 
ITNLSUR  approach,  developed  in  McElroy  and  Burmeister  (1988)  and  discussed 
above.  One  of  these  techniques,  iterated  nonlinear  weighted  least  squares  (INLSLS), 
is  analogous  to  INLSUR  with  the  notable  exception  that  it  allows  one  to  force  a  strict 
factor  structure.  They  find  similar  results  for  these  two  techniques  and  argue  that,  at 
least  for  the  set  of  factors  they  used,  the  choice  of  factor  structure  is  insignificant. 

Burmeister  and  McElroy  (1988)  make  another  valuable  contribution  to  the 
literature  in  this  paper.  They  point  out  that  in  empirical  asset-pricing  studies  it  is 
common  to  assume  zero  residual  risk  on  portfolios  used  to  represent  factors 
(mimicking  portfolios)  and  therefore  these  portfolios  can  be  used  as  exogenous 
variables  in  estimating  risk  premiums.  Their  third  technique,  iterated  nonlinear  three 


31They  use  a  mimicking  portfolio  technique  for  the  unobservable  factors. 
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stage  least  squares  (ITNL3SLS),  allows  for  non-zero  residual  risk.  They  find  that 
this  model  provides  different  results  from  either  of  the  other  techniques. 

Huang  and  Jo  (1992),  building  on  Grinblatt  and  Titman's  (1985)  theoretical 
paper,  demonstrate  that  a  strict  factor  structure  and  an  approximate  factor  structure 
coverage  to  one  another  as  the  number  of  securities  is  increased.  Specifically  they 
show  that  as  the  number  of  assets  increase,  an  appropriately  rotated  matrix  of  factor 
loadings  estimated  from  factor  analysis  converges  to  the  matrix  of  eigenvalues  from 
principal  components. 

Garrett  and  Priestley  (1997),  using  the  techniques  of  Burmeister  and  McElroy 
(1988),  investigate  the  factor  structure  in  relation  to  returns  on  securities  from  the 
London  Stock  Exchange.  They  argue  that  these  returns  are  best  described  by  an 
approximate  factor  structure  and  they  find  six  or  seven  priced  factors.  More 
importantly,  they  argue  that  Shanken's  (1992)  critique  of  Chen,  Roll,  and  Ross's 
(1980)  results  is  not  evidence  against  the  APT,  but  rather  evidence  that  the  wrong 
assumption  was  made  regarding  the  idiosyncratic  covariance  matrix.  Clare,  Priestley, 
and  Thomas  (1997)  reinforce  these  findings  by  reexamining  the  same  data  used  in 
Clare  and  Thomas  (1994).  When  recalculating  their  results  with  Shanken’s  EIV 
correction  or  using  Burmeister  and  McElroy's  ITNL3SLS  approach  that  assumes  a 
strict  factor  structure,  but  does  not  suffer  from  the  EIV  problem,  they  find  no 
significant  factors.  When  they  allow  for  an  approximate  factor  structure,  they  find 
five  priced  factors.32 

Shukla  and  Trzcinka  (1990)  compare  the  technique  of  asymptotic  principal 
components  with  that  of  factor  analysis.  Their  motivation  is  concern  that  the  two  are 
equivalent  only  if  the  idiosyncratic  risks  of  all  firms  are  equal  (which  one  reasonably 
assumes  is  not  the  case).  Factor  analysis  is  less  constrained  than  principal 

32The  authors  point  out  that  these  findings  should  also  apply  to  tests  of  the  CAPM 
and  might  shed  some  light  on  the  empirical  debate  on  the  "death  of  Beta" . 
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components  because  it  estimates  the  idiosyncratic  risks  at  the  same  time  it  estimates 
the  factors.  Principal  components  ignores  the  idiosyncratic  risks.  Therefore,  Shukla 
and  Trzcinka  posit  that  factor  analysis  might  outperform  principal  components.  They 
find  that  a  five  factor  model  based  on  principal  components  does  at  least  as  well  as 
the  five  factor  model  based  on  factor  analysis  in  terms  of  R-squared  values  and 
closeness  of  the  intercept  term  to  zero.  They  find  that  either  model  outperforms  a 
one  factor  CAPM  model.  Their  five  factor  model  was  able  to  explain  almost  40%  of 
the  variation  in  excess  returns. 

Huang  and  Jo  (1995)  look  at  whether  the  number  of  factors  tends  to  vary  with 
different  data  frequencies.  Monthly,  weekly,  and  daily  returns  are  used  to  extract 
factors  using  both  asymptotic  principal  components  and  factor  analysis.  They  find 
that,  after  correcting  the  daily  data  for  problems  due  to  nonsychronous  trading,  the 
number  of  factors  is  stable  across  all  frequencies.  The  results  find  only  one  or  two 
significant  factors. 

Another  popular  line  of  research  is  aimed  at  comparing  or  relating  the 
extracted  factors  to  macroeconomic  proxies.  In  my  opinion,  if  the  APT  holds,  this  is 
a  very  valuable  line  of  research.  In  essence,  by  extracting  the  factors  we  avoid 
having  to  make  arbitrary  choices  about  which  macroeconomic  variables  to  use,  yet  we 
give  the  extracted  factors  an  intuitive  meaning  by  relating  them  to  economic 
variables. 

One  of  the  first  papers  to  attempt  to  provide  a  link  between  extracted  factors 
and  other  variables  was  Fogler,  John,  and  Tipton  (1981).  They  argue  that  if  the 
CAPM  held  in  some  economy,  one  could  still  extract  factors  from  a  sample  of  100 
securities  and  find  a  second,  significant  factor  that  might  be  due  to  random  shocks  on 
several  stocks.  If  the  factors  could  be  shown  to  consistently  be  linear  combinations  of 
economic  variables,  then  it  would  make  a  strong  case  for  being  a  systematic  factor. 
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Their  empirical  analysis  is  crude  and  attempts  to  relate  three  extracted  factors  to  a 
broad  stock  market  index,  a  U.S.  Treasury  bill  index,  and  a  utility  stock  index. 

Connor  and  Korajczyk  (1988)  provide  some  comparisons  between  their 
derived  factors  and  some  of  the  economic  factors  used  by  Chen,  Roll,  and  Ross 
(1986).  They  regress  the  excess  returns  of  junk  bonds  (JBRET)  and  long  term 
government  bonds  (UTS)  on  their  factor  estimates.  The  two  variables  (JBRET  and 
UTS)  are  related  to  the  risk  premium  and  term  structure  variables  used  by  CRR. 
Connor  and  Korajczyk  find  that  their  first  factor  explains  7%  to  40%  of  the  variance 
in  JBRET.  With  five  factors  it  explains  up  to  59%  of  the  variance.  For  UTS  their 
first  factor  explains  0%  to  11%  and  with  five  factors  up  to  49%  of  the  variance.  The 
high  correlation  suggests  that  the  two  sets  of  factors  are  related. 

Kim  and  Wu  (1987)  take  a  somewhat  different  approach  in  relating 
macroeconomic  variables  to  APT  factors.  First  they  obtain  time-series  data  on 
approximately  15  macroeconomic  variables.  Next,  they  extract  factors  directly  from 
the  macroeconomic  data  and,  using  a  PROMAX  rotation,  identify  the  most  significant 
variables  in  each  factor.33  Kim  and  Wu  find  that  three  factors  account  for  88%  of  the 
variation  in  all  economic  variables.  They  then  use  these  extracted  factors  in  an  APT 
setting.  Taking  this  methodology  one  step  further,  Cho  and  Pak  (1991)  use 
interbattery  factory  analysis  to  extract  13  factors  that  are  common  between  a  group  of 
security  returns  and  a  group  of  macroeconomic  variables.  Starting  with  98 
macroeconomic  variables  they  are  able  to  rotate  the  initial  factor  loadings  and  obtain 
macroeconomic  interpretations  on  10  out  of  the  13  factors.  In  a  recent  paper,  Zhou 
(1996)  develops  a  generalized  method  of  moments  (GMM)  technique  that  estimates 
what  linear  combination  of  economic  variables  is  best  able  to  forecast  unobservable 
factors. 

33A  PROMAX  roation  seeks  to  rotate  the  factors  in  a  manner  that  each  one 
contains  only  a  few  highly  loaded  variables  -  thus  making  it  easier  to  interpret  them. 
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3.6  Various  Other  Techniques  and  Contributions 

Their  are  numerous  other  techniques  for  estimating  and  testing  the  APT  that 
can  be  found  in  the  literature.  Most  of  these  techniques  rely  on  advanced 
econometric  techniques.  For  completeness,  this  section  will  briefly  review  some  of 
the  more  prominent  areas. 

Mei  (1993a,  1993b)  develops  a  semiautoregressive  technique  (SAR).  The 
intuition  behind  this  technique  is  that  past  returns  can  be  used  as  proxies  for  the 
factor  loadings  because  they  span  the  same  return  space  as  the  betas.  In  this  sense 
the  technique  is  similar  to  the  technique  of  substituting  mimicking  factor  portfolio 
returns  for  unobservable  factors.  One  major  advantage  of  this  technique  is  that  it 
provides  an  asymptotic  covariance  matrix  for  the  factor  estimates. 

Mukherjee  and  Naka  (1995)  apply  a  six  factor  model  to  the  Japanese  stock 
market.  They  employ  a  vector  error  correction  which  has  advantages  in  its  ability  to 
explore  dynamic  co-movements  among  the  variables  examined  even  with 
nonstationary  data.  Mukherjee  and  Naka  choose  six  macroeconomic  factors  and 
simply  use  the  first  difference  of  natural  logs  to  approximate  the  innovations.  For  the 
period  1971  to  1990  they  find  that  all  of  the  variables  are  cointegrated  with  the 
market  returns. 

Bansal  and  Viswanathan  (1993)  argue  that  the  main  theoretical  and  empirical 
result  of  the  APT  is  the  existence  of  a  nonnegative  pricing  kernel.  They  suggest  that 
the  linear  relationship  between  the  factors  and  the  prices  is  overly  restrictive.  They 
develop  a  GMM  approach  to  estimate  the  pricing  kernel  and  find  that  their  results 
reject  a  linear  APT  but  support  a  nonlinear  model.  In  a  companion  paper,  Bansal, 
Hsieh,  and  Viswanathan  (1993)  apply  the  methodology  to  international  data  and  find 
further  evidence  supporting  their  nonlinear  model.  Similarly,  Abken,  Jarrow,  and 
Madan  (1996)  also  propose  a  nonlinear  APT  and  use  options  data  to  test  the  validity 
of  the  APT. 
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These  nonlinear  APT  models  are  similar  in  nature  to  conditional  APTs. 
Numerous  papers  have  looked  at  conditional  APT  models  which  allow  for  time 
varying  betas  and  risk  premiums.  Examples  include:  Engle,  Ng,  and  Rothschild 
(1990),  Koutoulas  and  Kryzanowski  (1996),  He,  Kan,  Ng,  and  Zhang  (1996),  and 
Kryzanowski,  Lalancette  and  To  (1997).  The  econometrics  for  these  models  are 
increasingly  complex  and  the  evidence  is  mixed.  In  a  recent  paper,  Ghysels  (1997) 
argue  that  the  dynamics  affecting  time  varying  betas  are  not  well  understood  and  if 
they  are  misspecified  the  pricing  errors  may  be  larger  than  in  a  constant  beta  models. 
They  give  empirical  examples  where  the  pricing  error  is  indeed  larger. 

Finally,  several  recent  papers  have  developed  sophisticated  new  techniques  for 
testing  asset  pricing  models.  Geweke  and  Zhou  (1996)  develop  a  Bayesian 
framework  for  measuring  APT  pricing  deviations.  Epps  and  Kramer  (1996)  develop 
a  test  that  is  based  on  how  the  cross-sectional  variance  of  returns  should  depend  over 
time  on  the  factor  realizations.  Velu  and  Zhou  (1996)  propose  a  GMM  technique  for 
testing  multifactor  models  that  is  valid  under  very  minimual  assumptions. 

Interestingly,  all  of  these  papers  reject  the  APT  in  favor  of  the  CAPM. 

3.7  Summary 

The  literature  on  the  APT  is  mixed.  The  APT  appears  to  explain  more  of  the 
cross-sectional  variation  in  asset  returns  and  it  explains  some  of  the  anomalies 
associated  with  the  CAPM.  However  there  is  still  much  uncertainty  regarding  the 
number  and  nature  of  the  factors.  While  there  has  been  some  evidence  that  the 
factors  might  be  sample  specific,  there  are  no  conclusive  results.  Previous  simulation 
results  have  shown  that  Connor  and  Korajczyk's  asymptotic  principal  components 
technique  is  an  effective  method  for  extracting  factors.  I  hope  to  show  that  their 
technique  can  also  be  used  to  directly  compare  factors  extracted  from  different 
portfolios.  This  concludes  the  review  of  the  current  literature.  The  following  chapter 


will  develop  the  testable  hypotheses  and  methodology,  reviewing  previous  work 
necessary. 


CHAPTER  4 


TESTABLE  HYPOTHESES  AND  METHODOLOGY 


This  chapter  develops  my  testable  hypotheses  and  the  methodology  I  will  use 
to  test  them.  First,  I  provide  a  brief  review  of  the  empirical  specification. 

This  empirical  specification  of  the  APT  closely  follows  Connor  and  Korajczyk 
(1988)  and  relies  on  Connor's  (1984)  Equilibrium  version  of  the  APT.  As  discussed 
in  Chapter  2  (and  presented  again  for  ease  of  the  reader)  this  version  of  the  APT 
assumes  that  the  random  returns  of  securities  are  generated  by  a  linear  k  factor  model 
of  the  following  form. 

ft  =  Et  +  Bl  + 

E\ft 1  =  0 
E[et\ft]  =  0 

ET“  “^1  _  VI 


where  E,  is  an  n- vector  of  expected  returns,  B  is  a  n  x  k  matrix  of  the  factor  loadings, 
**  is  a  k-vector  of  pervasive  factors,  and  ^  is  a  n- vector  of  idiosyncratic  returns.  I 


assume  an  approximate  factor  structure  so  that  Sr  is  not  constrained  to  be  diagonal. 

If  a  risk-free  rate  exists  and  random  returns  follow  a  model  as  in  equation 
(4.1),  the  equilibrium  version  of  the  APT  gives  us: 

Et  =  Xq  B 
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where  f<  is  the  risk  free  rate,  E,  is  the  expected  return,  ln  is  a  n-vector  of  ones,  and 

Xt  is  a  k- vector  of  the  risk  premiums  (or  price  of  the  risk).  Combining  equations 
(4.1)  and  (4.2)  yields: 

ft  ~  rf}n  =  B(ft  +  K)  =  (4.3) 

with  a  time-series  of  T  return  observations  for  each  of  the  n  securities,  equation  (4.3) 
is  written  as 

Rn  =  BnF  +  en  (4.4) 

where  R"  is  the  n  x  T  matrix  of  asset  excess  returns  (returns  in  excess  of  the  risk  free 

rate),  B  is  a  n  x  k  matrix  of  factor  loadings,  F  is  the  k  x  T  matrix  of  ^  +  ^ 
values,  and  e"  is  a  n  x  T  matrix  of  realized  idiosyncratic  error  terms. 

4.1  Testable  Hypotheses 

The  testable  hypotheses  come  directly  from  the  above  equations.  Tests  of  the 
pricing  implication  of  the  APT  equation  (4.2)  are  subject  to  several  criticisms. 
Primarily,  with  the  equilibrium  version,  any  test  of  equation  4.2  is  a  joint  test  that 
some  portfolio  is  mean- variance  efficient  relative  to  the  assumed  factor  structure. 

This  statement  is  somewhat  less  critical  than  the  similar  critique  of  the  CAPM 
provided  by  Roll  (1977)  which  states  that  a  specific  portfolio  (the  market  portfolio)  is 
globally  mean-variance  efficient.  Also,  with  an  approximate  factor  structure, 
statistical  tests  based  on  a  factors  price  are  very  difficult.  In  general  one  needs  to 
invert  a  sample's  covariance  matrix  to  compute  the  appropriate  test  statistic.  This 
task  is  computationally  burdensome  for  very  large  portfolios  and  unfeasible  for  the 
normal  case  where  the  number  of  assets  is  greater  than  the  number  of  time  periods. 
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Tests  of  the  linear  factor  generating  structure  avoid  some  of  these  difficulties. 
If  the  factor  structure  is  rejected  then  it  is  not  necessary  to  test  the  pricing 
relationship  as  it  will  not  hold.  If  the  assumption  can  not  be  rejected,  test  of  the 
pricing  implications  are  necessary. 

The  expected  return  of  each  asset  is  an  unknown  parameter,  therefore,  it  is 
impossible  to  directly  estimate  the  factors  from  equation  (4.1).  One  could  use  the 
time-series  mean  of  the  returns  as  an  estimate  but  this  will  introduce  a  bias  of  an 
unknown  magnitude  and  direction.  Equation  (4.4)  allows  one  to  extract  the  time- 
series  of  the  factor  plus  its  associated  risk  premium  (F  in  equation  (4.4)). 
Unfortunately  it  is  impossible  to  separate  these  two  parameters  and  the  test  becomes  a 
joint  test  of  the  equality  of  the  factors  and  their  associated  risk  premium  or  price. 

The  equilibrium  expected  return  relationship  in  equation  (4.4)  reveals  that  the 
price  of  a  factor  is  not  dependent  on  the  underlying  security  (i.e.,  the  price  is 
associated  with  the  factor  not  the  security).  If  a  factor  is  pervasive  it  affects  all  assets 
and  the  price  of  the  factor  is  the  same  for  all  assets.  Therefore,  if  I  draw  two 
random  samples  of  asset's  from  the  same  population  and  time  period  the  matrix  F  in 
equation  (4.4)  will  be  the  same.  More  specifically,  equation  (4.4)  for  sample  one  and 
two  respectively  is: 

IV  =  Bj"  Fj  +  Cj" 

R2n  =  B2n  F2  +  e2n  (4.5) 

from  which  I  obtain  my  first  hypothesis: 

HYPOTHESIS  #1: 

H0:  Fj  =  F2 
Ha:  F,  *  F2 

This  hypothesis  jointly  tests  the  equality  of  the  factors  and  the  associated 
prices.  I  would  like  to  test  only  the  equality  of  factors  but  can  not  extract  the  factors 
independently  of  the  prices.  Therefore  as  a  check  for  robustness  I  check  the  equality 
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of  prices  estimated  from  different  samples.  If  the  factor  is  pervasive  the  price  will  be 
the  same.  If,  on  the  other  hand,  the  factor  is  not  pervasive  and  only  affects  one  of 
the  samples,  the  price  need  not  be  the  same.  The  price  of  a  factor  can  vary  over  time 
but  by  looking  at  the  time-series  mean  from  equation  (4.2)  I  obtain  my  second 
hypothesis: 

HYPOTHESIS  #2: 

iv  *(*J)  =  *a?) 

H .  £(*,')  *  E(il) 

xxa* 

*  1  *  S'  2 

where  Kt  ana  Kt  are  (k  +  1)  vectors  of  the  first  and  second  samples  price's 

respectively.  The  first  element  is  \  and  should  be  equal  to  the  risk-free  or  zero  beta 
rate. 

4.2  Methodology 

This  section  discusses  my  methodology  for  testing  the  hypotheses  developed  in 
the  preceding  section.  I  will  also  compare  and  contrast  my  methods  with  those  of 
past  researchers  and  highlight  the  unique  contributions  of  this  research. 

4.2.1  Hypothesis  #1  -  Pervasiveness  of  Factors 

The  primary  contribution  of  this  research  is  the  development  of  a  methodology 
that  allows  for  a  direct  comparison  of  factors  extracted  from  separate  groups  of 
security  returns.  Very  few  researchers  have  looked  at  this  issue. 

Brown  and  Weinstein  (1983)  base  their  comparison  on  a  set  of  factor's  ability 
to  explain  the  cross-sectional  variation  in  returns.  Their  test  does  not  compare  each 
individual  factor  and  therefore  offers  little  insight  as  to  whether  the  same  factors  are 
generating  the  returns.  Cho  (1984)  uses  inter-battery  factor  analysis  and  forces  the 
factors  to  be  common  across  groups.  By  analyzing  the  factors  common  to  a  large 


58 


number  of  groups  (using  two  groups  at  a  time),  Cho  finds  that  the  number  of  factors 
between  groups  is  stable.  Again,  however,  there  is  no  guarantee  that  it  is  the  same 
set  of  factors  that  are  common  between  each  of  the  pairs.  Conway  and  Reinganum 
(1988)  use  cross-validation  techniques  to  determine  if  certain  factors  are  sample 
specific.  They  find  that  only  the  first  one  or  two  factors  extracted  from  a  sample 
improve  on  the  out-of-sample  explanatory  power.  Conway  and  Reinganum,  however, 
base  their  results  on  a  very  limited  analysis. 

My  methodology  is  unique  in  several  key  areas.  First,  I  will  use  sample  sizes 
in  the  range  of  1500  to  3000  securities.  All  of  the  papers  discussed  above  used  factor 
analysis  and  therefore  were  constrained  to  sample  sizes  of  30  to  60  securities.  This  is 
an  important  issue  given  the  asymptotic  nature  of  the  APT.  It  is  not  surprising  that  in 
a  small  portfolio  an  idiosyncratic  shock  to  one  or  two  securities  might  be  picked  up  as 
a  significant  factor.  The  likelihood  of  such  an  occurrence  decreases  as  the  sample 
size  increases.  Secondly,  the  previous  comparisons  assumed  a  strict  factor  structure, 
whereas  I  will  assume  an  approximate  factor  structure.  Garret  and  Priestley  (1997), 
among  others,  provide  evidence  that  security  returns  are  best  described  by  an 
approximate  factor  structure.  More  importantly,  Garret  and  Priestly  also  show  that 
empirical  results  are  sensitive  to  the  assumed  factor  structure  (strict  versus 
approximate). 

My  methodology  is  based  on  a  regression  technique  developed  in  Connor  and 
Korajczyk  (1988,  1993).  Connor  and  Korajczyk's  asymptotic  principal  components 
technique  extracts  factors  that  are  linear  combinations  of  the  true,  but  unknown, 
factors.  With  a  monte-carlo  evaluation  of  simulated  returns  data  they  demonstrate 
that  the  extracted  factors  can  be  compared  to  the  true  factors  by  looking  at  the  R- 
squared  values  obtained  in  a  regression  of  each  extracted  factor  (one  at  a  time)  on  the 
full  set  of  true  factors.  I  extend  this  methodology  to  compare  extracted  factors  across 
sub-samples.  I  use  extensive  simulations  to  evaluate  what  the  R-squared  values  will 
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look  like  under  a  variety  of  proposed  factor  structures.  I  then  evaluate  the  R-squared 
values  obtained  from  regressing  factors  extracted  from  real  world  data  on  factors 
extracted  from  a  different  sample  of  real  world  data.  Based  on  the  simulated  data  I 
can  then  make  inferences  about  whether  or  not  the  evidence  supports  or  rejects  the 
first  hypothesis. 

As  developed  in  Chapter  3,  Connor  and  Korajczyk  develop  a  technique  for 
extracting  the  factors  from  the  following  T  xT  product  matrix 

Q"  =  |lW  Rn 

W  (4.6) 

Connor  and  Korajczyk  then  prove  that 

Gn  =  LnF  +  d>"  (4.7) 

where  G"  is  an  orthonormal  kxT  matrix  whose  columns  are  the  first  k  eigenvectors 
of  Q",  L"  is  a  nonsingular  matrix,  and  O"  converges  in  probability  to  the  zero 
matrix.34  I  use  the  IMSL  routine  EVESF  to  extract  the  k  largest  eigenvalues  and  their 
associated  orthonormal  eigenvectors  from  the  cross-product  matrix  in  equation  (4.6).35 

The  extracted  factors  can  not  be  compared  on  a  one-to-one  basis  with  the  true 
factors  due  to  the  rotational  indeterminacy  (as  seen  by  L"  in  equation  (4.7)). 

However,  Connor  and  Korajczyk  (1988,  1993)  show  that  equation  (4.7)  provides  a 
method  for  evaluating  the  extraction  technique.  Equation  (4.7)  shows  that  each  of  the 
extracted  factors  is  a  linear  combination  of  the  true  factors  plus  an  error  term  (the  <£> 
matrix)  that  approaches  zero  in  the  limit.  So  if  one  regresses  each  of  the  extracted 


34Connor  and  Korajczyk  provide  a  proof  of  this  is  in  their  1986,  1988,  and  1993 
papers.  Perhaps  the  clearest  development  can  be  found  in  Theorem  1  of  their  1988 
paper. 

35With  a  few  exceptions,  all  the  data  manipulation  and  testing  is  done  in 
FORTRAN  making  extensive  use  of  the  IMSL  (MATH  and  STAT)  routines. 
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factors,  one  at  a  time,  on  the  full  set  of  true  factors  the  I?  values  should  approach 
one  as  n  approaches  infinity  (this  technique  will  be  referred  to  as  the  factor  regression 
technique).  The  R2  value  will  deviate  from  one  for  small  samples  because  of  the 
error  term.  Connor  and  Korajczyk  use  simulated  data  to  demonstrate  the 
effectiveness  of  the  technique  for  various  factor  structures. 

I  essentially  repeat  Connor  and  Korajczyk' s  analysis  for  my  data  sample  with 
three  primary  differences:  I  look  at  various  sample  sizes;  I  expand  on  the  number  of 
factor  structures  analyzed;  and  I  simulate  the  returns  based  on  a  ^-factor  structure  and 
then  extract  ( k  +  2)  factors.  These  simulations  will  show  that  the  extraction 
technique  works  and  is  robust  to  the  underlying  factor  structure.  The  simulations  also 
provide  evidence  on  how  large  the  samples  need  to  be  to  effectively  ignore  the  error 
term  and  show  that  the  technique  can  distinguish  "true"  factors  from  false  factors  (the 
extra  two  factors  that  were  extracted).  Details  of  the  analysis  and  each  of  the 
proposed  factor  structures  are  provided  in  my  Chapter  6. 

The  most  significant,  and  unique,  contribution  of  my  research  is  using 
equation  (4.7)  to  compare  sets  of  factors  extracted  from  different  samples.  From 
equation  (4.7),  let 


G,"  =  Ljn  F  +  fly 

(4.8) 

G2n  =  V  F  +  02n 

(4.9) 

for  samples  one  and  two  respectively.  I  assume  that  n  is  sufficiently  large  so  that  the 
error  term  in  equation  (4.8)  is  negligible  (this  assumption  will  be  supported  with 
evidence  from  simulations).  Since  L>n  is  nonsingular,  I  solve  equation  (4.8)  for  F  and 
plug  the  result  into  equation  (4.9)  yielding: 

G "  =  L^L^'g?) 

=  l:g” 


(4.10) 
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Therefore,  if  the  returns  from  the  two  samples  are  generated  by  the  same  k  factors; 
the  factors  from  sample  one  will  be  linear  combinations  of  the  factors  from  sample 
two.  I  can  therefore  use  the  regression  technique  described  above  to  compare  the  two 
sets  of  factors. 

For  each  assumed  factor  structure,  I  run  a  monte-carlo  evaluation  by 
extracting  subsets  of  the  simulated  returns  and  comparing  the  two  sets  of  factors  using 
the  regression  technique  discussed  above.  After  evaluating  the  simulation  data,  I  run 
monte-carlo  evaluations  based  on  the  real  sample  data.  I  use  a  f-test  to  compare  the 
mean  R-squared  values  from  the  real  data  to  the  simulated  data  in  order  to  identify 
those  factors  that  are  pervasive. 

4.2.2  Hypothesis  #2  -  Equality  of  Risk  Premiums 

Numerous  papers  have  compared  risk  premiums  across  different  time  periods. 
Very  few  have  compared  the  risk  premiums  estimated  from  the  two  distinct  samples 
over  the  same  time  period.  Brown  and  Weinstein  (1983)  is  one  of  the  few 
exceptions.  They  compare  the  entire  vector  of  risk  premiums  from  extracted  factors 
(since  individual  f-tests  are  not  valid).  Brown  and  Weinstein  reject  equality  in  every 
case.36  I  use  macroeconomic  variables  as  proxies  for  the  factors  and  can  compare  the 
price  for  an  individual  factor  across  groups. 

The  first  hypothesis  involves  comparing  the  entire  time-series  of  F!  and  F2. 

For  the  second  hypothesis  I  make  use  of  the  fact  that  the  expected  value  of  the  factors 
is  zero,  so  the  value  of  the  time-series  mean  of  F  in  equation  (4.4)  will  approach  the 
expected  value  of  the  risk  premium. 

Although  there  is  a  considerable  amount  of  literature  looking  at  advanced 
techniques  for  estimating  the  risk  premiums  associated  with  macroeconomic  proxies,  I 


36Brown  and  Winstein  suggest  that  the  rejections  might  be  due  to  the  extremely 
large  number  of  daily  observations  on  each  security.  After  making  a  correction  for 
this  they  find  weak  evidence  of  equality. 
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will  follow  the  Fama-MacBeth  technique  used  by  Chen,  Roll,  and  Ross  (1986)  and 
discussed  in  Chapter  3.  This  technique  allows  a  large  number  of  securities  to  be 
analyzed  and  is  still  prevalent  in  the  literature. 

The  methodology  consist  of  two  main  steps.37  In  the  first  step,  I  run  a 
separate  times-series  regression  of  the  following  form  to  estimate  the  betas 
(sensitivities)  for  each  of  i  portfolios: 

ri,t  =  ai+pi,MPMPt+piiURpURPt+pi>wsUTSt+piiUIUIt+piiDEIDEIt+ei,t  (4.11) 
For  the  regression  in  equation  (4.11)  I  use  five  years  of  data  prior  to  the  start  of  the 
sample  period. 

In  the  second  step,  the  estimated  beta  coefficients  are  used  as  independent 
variables  in  cross-sectional  regressions  of  the  following  form 

rijt  =  ^0  +  ^MP$i,MP+^  URP  P  i,  URP +^UTsfii,UTS  +  ^Ulfii,UI+^DEI$i,DEI+ei  (4-12) 

A  separate  cross-sectional  regression  is  run  for  each  month  of  the  year  following  the 
five  year  estimation  period.  This  procedure  is  repeated  for  each  year  in  the  sample 
period,  resulting  in  a  time-series  of  60  observations  for  each  of  the  risk  premiums. 
The  times-series  mean  of  each  of  the  risk  premiums  can  be  tested  by  a  f-test  for 
significant  departures  from  zero.  I  will  also  use  a  Mest  to  compare  the  risk 
premiums  estimated  from  unique  subsets  of  the  data. 

In  order  to  partially  alleviate  the  errors-in-variables  problem  that  arises  by 
using  the  estimated  beta  values  in  the  second-step  regressions,  I  group  the  securities 
into  portfolios  before  running  the  second-step  regressions.  It  is  critical  that  the 
sorting  scheme  provides  a  good  dispersion  in  expected  returns.  If  each  portfolio  has 
similar  expected  returns,  the  cross-sectional  regressions  will  have  little  power  to 
determine  the  prices. 


37The  actual  proxies,  and  their  definitions  are  discussed  in  Chapter  5. 
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I  look  at  two  common  approaches  for  grouping  the  portfolios;  size  and 
industry.  I  want  the  groupings  to  eliminate  idiosyncratic  shocks,  but  maintain  a  wide 
dispersion  in  exposure  to  the  systematic  shocks  (i.e.,  maintain  a  wide  dispersion  in 
beta's).  Groupings  based  on  industries  is  an  obvious  choice  for  eliminating 
idiosyncratic  risks.  Size  is  perhaps  the  most  common  technique  for  grouping  and 
clearly  provides  a  good  dispersion  in  returns. 

Previous  research  has  documented  that  the  price  of  the  factors  is  affected  by 
the  portfolio  grouping  scheme.  This  apparent  violation  of  the  APT  is  probably  due  to 
an  inherent  weakness  in  the  empirical  methodology.  The  portfolios  need  to  be 
formed  in  a  manner  that  disperses  the  securities  with  respect  to  their  sensitivity  to  the 
various  factors.  If  there  is  only  one  factor  (as  in  tests  of  the  CAPM)  this  task  is 
easy.  With  just  one  factor,  sorting  on  expected  returns  will  automatically  disperse  the 
single  beta.  On  the  other  hand,  sorting  on  expected  returns  may  or  may  not  disperse 
all  of  the  betas  in  an  APT  context.  Warga  (1989)  argues  that  different  grouping 
schemes  will  maximize  dispersion  of  assets'  betas  for  some  factors  but  will  yield  little 
dispersion  in  the  assets'  betas  for  other  factors.  Therefore  these  methods  will  give 
precise  estimates  for  some  of  the  risk  premiums  and  imprecise  for  others. 


CHAPTER  5 


DATA  SELECTION  AND  SOURCES 

This  chapter  provides  a  review  of  the  various  data  sources  used  in  this 
dissertation.  Where  necessary  I  discuss  the  selection  criteria  and  compare  my  choices 
to  those  prevalent  in  the  literature. 


5.1  Security  Returns 

I  use  common  stocks  traded  on  the  New  York  Stock  Exchange  (NYSE), 
American  Stock  Exchange  (AMEX),  and  the  Nasdaq  market.  Returns  data  for  the 
securities  are  obtained  from  the  Center  for  Research  in  Security  Prices  (CRSP)  1995 
Stock  File.  I  use  only  continuously-listed  firms  with  no  missing  returns  data,  thereby 
eliminating  any  error  associated  with  estimated  returns.  At  first  glance  one  might 
argue  that  the  use  of  only  continuously-listed  firms  will  bias  the  results;  since  it  is 
possible  that  omitting  a  particular  set  of  firms  will  cause  one  to  omit  relevant  factors. 
However,  all  subsets  will  be  subject  to  the  same  omission  and  therefore  will  still  be 
affected  by  the  same  factors. 

The  APT  is  not  constrained  to  any  particular  return  period,  and  previous 
empirical  studies  use  daily,  weekly,  and  monthly  return  data.  Huang  and  Jo  (1992) 
show  that  the  data  frequency  has  no  effect  on  the  number  of  significant  factors. 
Therefore,  the  choice  of  data  frequency  should  not  affect  the  robustness  of  this  study. 
For  the  first  hypothesis,  factor  equivalence  across  samples,  the  results  will  hinge  on 
the  asymptotic  equivalency  of  a  sample's  T  x  T  product  matrix.  Therefore,  from  an 
estimation  standpoint,  I  prefer  a  large  number  of  time-series  observations. 
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Unfortunately,  the  parameters  being  estimated  (e.g.,  betas  and  risk  premiums)  may 
not  be  stable  over  long  time  periods.  In  fact,  there  is  a  large  body  of  literature 
suggesting  these  parameters  vary  with  time.  Daily  data  would  maximize  the  number 
of  observations  for  a  given  time  period.  Unfortunately,  it  is  well  documented  that 
daily  data  can  bias  estimates  of  variances  (due  to  asynchronous  trading)  and  means 
(due  to  the  bid-ask  spread).  Monthly  data  partially  alleviates  these  problems  but 
restricts  the  number  of  observations. 

Therefore,  I  compromise  and  construct  weekly  returns  from  the  daily  CRSP 
observations.38  Weekly  returns  are  calculated  as  the  compounded  return,  including 
any  dividends,  from  the  last  trading  day  of  the  previous  week  to  the  last  trading  day 
of  the  current  week.  Therefore,  most  weeks  will  have  five  trading  days  but  some 
will  have  less  due  to  market  holidays.  For  the  second  hypothesis,  price  equivalence 
across  samples,  I  use  monthly  returns.  This  is  due  to  the  fact  that  most  of  the 
macroeconomic  variables  are  only  available  on  a  monthly  basis.  Again,  I  construct 
the  monthly  returns  from  the  CRSP  daily  observations.  However,  with  monthly 
returns  I  allow  for  missing  daily  observations. 

The  final  consideration  is  the  length  of  the  estimated  interval;  which,  as 
mentioned  above,  is  a  choice  between  stationarity  and  sample  size.  I  conduct  the 
tests  over  a  ten  year  interval  and,  when  possible,  two  five  year  subintervals.  The 
separate  five  year  periods  are  1986  through  1990  and  1991  through  1995.39  For  each 


38This  is  the  same  approach  used  in  many  studies,  including  those  by  Lehmann 
and  Modest  (1988),  Shukla  and  Trzcinka  (1990),  and  McCulloch  and  Rossi  (1990). 
However,  some  of  these  define  the  week  on  a  Wednesday  to  Tuesday  basis  since 
there  are  fewer  holidays  on  these  days  and  to  alleviate  any  day  of  the  week  effect. 

39One  of  the  simulation  models  requires  that  t  (the  number  of  observations)  be 
greater  than  n  (the  number  of  securities);  in  this  case  I  do  not  look  at  the  five  year 
intervals. 
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five  year  interval  I  retrieve  260  weekly  and  60  monthly  returns.  The  final  number  of 
securities  in  the  various  samples  range  from  1680  to  3308. 

5.2  Macroeconomic  Variables 

The  methodology  requires  the  use  of  excess  returns  (returns  in  excess  of  the 
risk-free  rate).  There  is  some  questions  as  to  the  validity  of  short  term  risk-free  rates 
implied  from  T-bills.40  For  this  reason,  I  construct  weekly  risk-free  rates  using  daily 
observations  of  a  three  month  T-bill  rate.  The  St.  Louis  Federal  Reserve  Bank's 
Economic  Data  Base  (FRED)  provides  a  daily  rate  for  the  shortest  term  T-bill  having 
at  least  three  months  to  maturity.  For  each  week  I  calculate  the  average  of  these 
daily  rates  and  convert  the  average  into  a  weekly  rate. 

For  the  second  hypothesis,  I  use  the  monthly  T-bill  rate  from  Ibbotson 
Associates  (hereafter  referred  to  as  IA).41  IA  use  the  CRSP  Government  Bond  file  to 
find  the  shortest  term  bill  having  at  least  one  month  to  maturity.  They  price  the  bill 
on  the  last  day  of  the  previous  and  current  trading  month  and  calculate  a  total  return 
based  on  these  prices. 

As  previously  discussed,  the  macroeconomic  proxies  will  only  be  used  in 
testing  the  second  hypothesis,  equivalence  of  risk  premiums  across  groups.  I  use  five 
variables  that  are  similar  to  those  used  by  Chen,  Roll,  and  Ross  (1986).  The  CRSP 
equally  weighted  index  is  also  used  (both  simultaneously  with  the  other  variables  and 
alone).  The  five  proxies  are  derived  from  various  basic  series  discussed  in  the 
following  paragraphs. 


40Retums  on  short  term  T-bills  are  highly  volatile  due  to  their  use  in  many 
hedging  strategies.  Many  thanks  to  Dr.  Jim  Hilliard  for  pointing  this  out  and  for  his 
insightful  discussions  regarding  risk-free  rates. 

41  All  of  the  Ibbotson  &  Associate's  data  is  taken  from  "Stocks,  Bonds,  Bills,  and 
Inflation  -  1995  Yearbook, "  published  by  Ibbotson  &  Associates. 
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5.2.1  Basic  Monthly  Series 

1 .  CPI  =  seasonally  adjusted  consumer  price  index  from  the  Bureau  of 
Labor  Statistics.  There  is  some  question  as  to  the  use  of  seasonally 
versus  nonseasonally  adjusted  variables  and  examples  of  both  can  be 
found  in  the  literature.  To  the  extent  that  I  want  unexpected 
movements  the  use  of  seasonally  adjusted  variables  makes  more  sense. 

2.  LGB  =  total  return  on  a  long  term  government  bond  from  IA.  Using 
data  from  the  Wall  Street  Journal,  IA  choose  one  bond  with  a  term  of 
approximately  20  years.  They  choose  a  bond  with  a  current  coupon 
and  without  special  tax  features,  call  privileges,  etc.  The  total  return  is 
based  on  the  flat  price  (average  of  bid  and  ask  prices)  plus  any  accrued 
coupon  interest. 

3.  TB  =  one  month  T-bill  rate  from  IA  (discussed  under  risk  free  rates 
section  above). 

4.  IP  =  seasonally  adjusted  industrial  production  index  from  the  Survey  of 
Current  Business. 

5.2.2  Derived  Monthly  Proxies 

Based  on  the  above  basic  series  I  construct  five  proxies.  I  use  the  same  names 
as  Chen,  Roll,  and  Ross  (1986)  but  some  of  the  derivations  are  slightly  different. 

The  proxies  are  discussed  in  the  following  paragraphs: 

1 .  MP  =  monthly  growth  in  industrial  production.  MP(t)  =  ln[IP(t)/ 
IP(M)].  As  discussed  by  Chen,  Roll,  and  Ross,  IPfr)  is  a 
measurement  that  lags  actual  activity  by  at  least  part  of  a  month,  so 
this  variable  will  lead  the  other  factors  by  one  month. 

2.  URP  =  a  measure  of  the  risk  premium  associated  with  default  spreads. 
URP(t)  =  LCB(r)  -  LGB(t).  Chen,  Roll,  and  Ross  use  the  return  on 
low-grade  corporate  bonds  obtained  from  IA.  This  variable  was  not 
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available  and  thus  I  use  a  slightly  different  variable.  A  large  portion  of 
the  return  on  junk  bonds  can  be  thought  of  as  a  call  option  on  the 
underlying  equity  of  the  firm.  Therefore,  URP  is  somewhat  related  to 
a  market  index.  This  might  partially  explain  the  high  significance  of 
URP  in  Chen,  Roll,  and  Ross's  results. 

3.  UTS  =  a  measure  of  the  risk  associated  with  maturity  spreads.  UTS (7) 
=  LGB(0  -  TB(f). 

4.  UI  =  a  measure  of  unanticipated  inflation.  UI(f)  =  1(f)  -  E[I(f)/I(M)]. 
Actual  inflation  is  defined  as  the  monthly  difference  in  the  logarithm  of 
CPI  (i.e. ,  1(f)  =  ln[CPI(f)/CPI(f-l)]).  Next,  I  define  expected  inflation, 
E[I(f)/I(f-l)],  as  the  fitted  values  from  an  ARIMA(  1,1,1)  time-series 
model  for  1(f).  Then,  UI(f)  is  defined  as  the  residual  or  unexpected 
component  of  the  1(f)  time-series.  A  review  of  the  literature  shows 
numerous  methods  for  constructing  the  expected  inflation  series. 

Chen,  Roll,  and  Ross  use  the  results  of  Fama  and  Gibbons  (1984)  who 
back  out  expected  inflation  from  Fisher's  Equation.42  Others  have  used 
Kalman  filtering  techniques  or  various  ARIMA  specifications. 

5.  DEI  =  change  in  expected  inflation.  DEI(f)  =  E[I(f+ 1)/I(f)]  - 
E[I(f)/I(f-l)] . 

As  mentioned  in  Chapter  3,  there  are  numerous  method  for  selecting  and 
calculating  the  proxies.  These  five  are  chosen  as  a  starting  point  based  on  their 
prevalence  in  the  literature. 


42The  Fisher  equation  relates  risk  free  rates  to  expected  real  returns  and  expected 
inflation. 
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5.2.3  Properties  of  the  Macroeconomic  Proxies 

The  macroeconomic  proxies  are  assumed  to  be,  or  at  least  approximate,  mean- 
zero  innovations  or  unexpected  movement.  As  such,  the  time-series  of  these 
variables  should  display  little  autocorrelation  and  their  means  should  be  close  to  zero. 
Table  5-1  lists  the  mean,  variance,  and  autocorrelations  for  each  of  the 
macroeconomic  variables.  With  the  possible  exception  of  the  MP  series,  none  of  the 
autocorrelation  functions  appear  to  be  large  and  thus  the  series  can  be  used  as 
innovations.  The  mean  values  are  all  near  zero  and  therefore  I  do  not  attempt  to 
subtract  off  a  running  mean. 

Table  5-2  shows  the  correlations  across  the  proxies.  URP  and  UTS  are  highly 
correlated  since  they  both  have  LGB  in  their  definition.  I  define  a  second  risk 
premium  (URP-2)  to  try  and  lessen  the  collinearity  between  URP  and  UTS.  First,  I 
calculate  the  monthly  yield  differential  between  a  portfolio  of  Baa  rated  commercial 
bonds  and  a  portfolio  of  long-term  government  bonds  (both  given  in  Moody’s  annual 
bond  manuals).  URP-2  is  then  the  first  difference  of  the  natural  log  of  the  yield 
differential  series.  Since  yields  are  inversely  correlated  with  returns  URP-2  should  be 
negatively  correlated  with  URP.  Unfortunately  URP-2  is  still  highly  correlated  with 
UTS. 

Unlike  some  researchers,  I  do  not  find  a  high  correlation  between  UI  and 
DEI,  suggesting  that  for  this  period  of  data  the  ARIMA(1,1,1)  model  does  an 
adequate  job  of  filtering  out  the  unanticipated  component  of  inflation.  As  Table  5-2 
illustrates  all  of  the  variables  are  somewhat  correlated  and  this  should  be  kept  in  mind 
when  interpreting  the  results.  Koutoulas  and  Kryzanowski  (1996)  orthogonalize  the 
factors  by  successively  regressing  them  on  each  other.  This  approach  eliminates  the 
collinearity  but  involves  arbitrary  choices  as  to  the  order  of  orthogonalization. 
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5.3  Summary 

This  concludes  the  discussion  on  the  various  data  sources  and  proxies  used  in 
this  research.  A  word  of  caution  is  necessary  regarding  the  macroeconomic  proxies. 
There  are  numerous  choices  being  made  when  using  these  factors.  These  choices 
involve  the  specification  of  the  innovations,  the  alignment  of  the  proxies  with  the 
returns,  collinearity  issues,  etc.  I  have  lightly  touched  on  some  of  these  choices  in 
the  preceding  sections  and  in  Chapter  3.  Previous  papers  have  shown  that  the 
significance  of  factors  is  very  sensitive  to  these  and  other  issues  and  as  such  the 
results  presented  in  the  next  chapter  apply  only  to  the  proxies  as  I  have  defined  them. 
Clearly  much  work  remains  to  be  done  in  this  area. 


CHAPTER  6 


RESULTS 

This  chapter  presents  the  empirical  analysis.  The  first  section  looks  at 
hypothesis  #1  (equivalence  of  factors)  and  the  second  section  looks  at  hypothesis  #2 
(equality  of  prices).  When  possible  the  results  are  compared  to  those  found  in  the 
literature. 


6.1  Hypothesis  #1  -  Equivalence  of  Factors 
Tests  of  the  first  hypothesis  involve  comparing  real  world  data  to  simulated 
data.43  Before  presenting  the  results,  it  is  necessary  to  discuss  the  various  simulation 
models  used  in  this  analysis.  All  of  the  models  assume  a  five  factor  generating 
process.  The  key  differences  are  in  the  assumed  factor  structure  (i.e.,  strict  versus 
approximate)  and  in  how  the  factors,  betas,  and  idiosyncratic  terms  are  generated.  In 
general,  I  need  to  generate  a  matrix  of  excess  asset  returns  that  follow  the  linear 
factor  structure  of  equation  (4.4).  The  following  section  discusses  how  the  five 
models  accomplish  this  task.  Table  6-1  provides  a  summary  of  the  techniques. 

6.1.1  Simulation  Models 

The  first  model  is  based  on  the  simulation  used  in  Connor  and  Korajczyk 
(1988).  Five  factors  are  extracted  from  the  full  sample  of  security  returns.  These  are 
assumed  to  be  the  "true"  factors.  The  "true"  betas  are  obtained  by  ordinary  least 


43Throughout  this  chapter  the  term  "real  world  data"  means  actual  returns  from 
NYSE/AMEX/Nasdaq  securities  and  "simulated  data"  means  returns  generated  from 
the  models  described  in  the  following  section. 
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squares  (OLS)  regressions  of  the  excess  security  returns  on  the  extracted  factors.  The 
idiosyncratic  terms  are  assumed  to  be  temporally  independent  but  cross-sectionally 

«2 

correlated.  Specifically,  let  °«  be  the  variance  of  the  residuals  from  the  above  OLS 
regressions.  The  idiosyncratic  return  for  asset  i  is 
ei  =  ’ll 

6;  =  Pe«-i  +  Tli>  *  =  2’  •**»  n  (6.1) 


where  r|j  is  a  random  draw  from  a  normal  distribution  with  mean  zero  and  variance 
s2  -  where  s2  is  set  so  that  var(Ej)  =  £2. 

If  p  =  0  the  simulated  returns  follow  a  strict  factor  structure.  If  0  <  p  <  1, 
then  every  securities  idiosyncratic  term  is  correlated  with  one  another  (cross- 
sectionally).  The  BnF  matrix  is  constant  for  every  iteration,  only  the  en  matrix  is 
simulated. 

Since  the  simulated  factors  are  based  on  the  extracted  factors  they  will  not  be 
equally-important  factors.  Figures  6.1  through  6.3  show  a  plot  of  the  eigenvalues  for 
the  various  samples.  These  figures  clearly  show  that  the  first  one  or  two  factors  are 
dominant  and  the  others  may  not  be  significant.  Of  course  the  extracted  factors  may 
have  loaded  heavily  on  the  first  eigenvector  -  if  the  other  factors  are  pervasive 
however,  the  regression  technique  described  in  Chapter  4  should  still  work. 

The  second  model  is  based  on  a  model  in  Connor  and  Korajczyk  (1993).  This 
model  is  different  from  the  first  in  several  ways.  First,  it  assumes  equally-important 
factors,  and  second  it  allows  for  the  factors  and  betas  to  be  random  draws  and  thus 
vary  with  each  simulated  draw.  Following  the  development  in  Connor  and  Korajczyk 
(1993),  the  model  of  excess  returns  is: 
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ru =  Bift  +  eu  1  = !» •••»  »;  *  =  i,  t 

Bt  ~  MVN  (l5,o2bI5) 

/,  -  MIW  (u/5,  a;  75) 

ef  ~  MVN  (0",  o]ln)  (6  2) 

where  l5  is  a  5  x  1  vector  of  ones,  e‘  is  a  n  x  1  vector  of  idiosyncratic  returns  for 
period  t  and  0"  is  an  n  x  1  vector  of  zeros.  As  seen  in  equation  (6.2)  the  betas  are 
i.i.d.  with  mean  1  and  variance  ab2,  the  factors  are  i.i.d.  with  mean  risk-premium  n 
and  variance  af2,  and  the  idiosyncratic  terms  are  i.i.d  with  mean  zero  and  variance 
<Je2- 

There  are  four  parameters  in  the  model,  crb2,  af2,  ae2,  and  n.  Connor  and 
Korajczyk  suggest  the  following  scheme  for  setting  the  parameters.  <re2  is  given  by 
the  cross-sectional  average  mean-squared  residual  after  extracting  5  factors  and 
running  an  OLS  regression  as  in  the  first  model.44  The  remaining  three  parameters 
are  tied  to  the  average  excess  return  to  the  CRSP  equally  weighted  index  (E  fcw]), 
the  variance  of  the  equally  weighted  index  (cs2^)  and  the  variance  of  the  average  asset 
in  the  sample  (a2).  Details  of  the  procedure  are  found  in  Connor  and  Korajczyk 
(1993). 

In  summary  then,  the  second  model  uses  simulated  values  for  B",  F,  and  e". 
The  model  uses  sample  and  market  observations  to  set  the  parameters  and  each  of  the 
factors  play  an  equal  role  in  generating  returns.  The  second  model  assumes  a  strict 
factor  structure. 

The  third  model  is  identical  to  the  second  model  in  every  way  but  one;  the 
third  model  sets  the  simulation  parameters  using  only  sample  data.  Specifically,  I 
construct  an  equally  weighted  portfolio  of  the  securities  in  my  sample  and  use  this 


44 Actually  Connor  and  Korajczyk  only  use  four  factors  in  their  model.  Their 
methodology  was  adjusted  accordingly  for  my  five  factor  model. 
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portfolio's  mean  and  variance  in  place  of  the  CRSP  equally  weighted  index's  mean 
and  variance.  This  distinction  may  be  important  since  the  CRSP  equally  weighted 
index  has  almost  twice  the  expected  return  of  my  sample's  index.45  The  ability  to 
extract  factors  will  depend  on  the  relative  proportion  of  the  return  that  is  explainable 
(i.e.  BnF)  versus  the  idiosyncratic  portion  (e11).  Using  sample  data  to  set  en  and 
market  data  to  set  BnF  will  bias  the  method  towards  being  able  to  successfully  extract 
the  factors  if  the  market  index  has  a  higher  expected  return  than  the  sample. 

The  fourth  model  makes  one  modification  to  the  third  model.  Instead  of 
equally  important  factors,  I  scale  the  betas  so  that  each  factor  is  responsible  for 
varying  degrees  of  the  expected  component  of  returns.  Specifically,  I  simulate  the 
factors,  betas,  and  residuals  as  in  the  third  model  but  then  I  multiply  B,  by  4,  B2  by 
2,  B3  by  1,  B4  by  0.5  and  B5  by  0.25.  This  scaling  preserves  the  proportion  of 
returns  that  is  explainable  but  it  shifts  the  amount  attributable  to  each  individual 
factor. 

The  fifth  model  is  perhaps  the  most  important  from  a  theoretical  standpoint. 
The  first  model  allows  for  either  a  strict  factor  model  or  a  factor  model  where  every 
securities  idiosyncratic  terms  are  correlated.  The  second  through  fourth  models  all 
assume  a  strict  factor  structure.  The  asymptotic  principal  components  technique, 
however,  is  based  on  an  approximate  factor  structure.  The  fifth  model  generates  BP 
and  F  the  same  as  in  the  third  model  but  then  models  the  idosyncratic  portion  as  a 
random  draw  from  a  multivariate  normal  distribution  with  mean  (?  and  a  variance- 
covariance  matrix  based  on  the  actual  residuals  from  the  OLS  regressions  (the  other 
techniques  used  only  the  diagonal  elements  of  this  matrix).  I  use  the  IMSL  routine 


45This  difference  is  probably  driven  by  the  exclusion  of  newly  listed  firms,  firms 
involved  in  mergers,  and  thinly  traded  firms  -  all  of  which  generally  outperform  the 
market. 
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RNMVN  to  generate  the  random  draw  of  e".  This  routine  uses  a  Cholesky 
decomposition  of  the  sample  covariance  matrix  to  generate  the  multinormal  variates. 

In  order  to  estimate  the  n  x  n  covariance  matrix  for  the  sample,  n  (the  number 
of  securities)  has  to  be  less  than  T  (the  number  of  observations)  so  I  use  only  the  full 
10  year  period  (not  5  year  intervals)  when  using  this  model. 

6.1.2  Comparing  Factors  Extracted  From  Real  Data 

The  first  results  compare  the  factors  extracted  from  mutually  exclusive 
portfolios  of  actual  security  returns.  I  randomly  choose  two  portfolios  of  n  securities 
(where  n  is  equal  to  50,  150,  500,  and  1500)  and  extract  seven  factors  from  each 
portfolio.46  I  then  regress  the  factors  from  the  first  sample  (one  at  a  time)  on  all 
seven  of  the  factors  extracted  from  the  second  sample.  I  repeat  this  procedure  100 
times  and  then  calculate  the  mean  and  standard  deviation  of  the  R2  values  for  each  of 
the  seven  factors.  These  results,  along  with  the  maximum  and  minimum  for  each  R2 
value,  are  provided  in  Table  6-2. 

I  could  also  have  regressed  the  factors  from  the  second  sample  on  the  factors 
from  the  first  sample  -  since  I  do  not  know  which  set  of  factors  is  the  "true"  set.  If 
the  factors  were  identical  the  R2  values  would  be  the  same  in  either  case.  This 
technique  would  have  doubled  my  sample  size  but  may  have  introduced  a  bias  of  an 
unknown  direction  and  magnitude.  The  choice  of  seven  for  the  number  of  factors  is 
somewhat  arbitrary.  A  survey  of  the  current  literature  reveals  a  somewhat  polar 
distribution  -  some  feel  there  are  only  one  or  two  factors  while  others  argue  there  are 
a  very  large  number.  There  is  a  tradeoff  between  extracting  too  few  or  too  many 
factors.  I  would  like  to  extract  all  of  the  pervasive  factors  and  one  or  two  non- 
pervasive  or  "false"  factors.  This  would  allow  for  a  robust  test  of  the  technique  since 
the  pervasive  factors  should  have  R2  values  near  one  while  the  "false"  factors  will 

46For  the  overall  10  year  period  I  use  n  —  1000  instead  of  1500  (since  there  are 
only  2334  securities  in  this  sample). 
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not.  Unfortunately  extracting  too  many  factors  is  computationally  burdensome  and 
might  bias  the  results  (since  each  additional  factor  acts  as  a  linearly  independent 
regressor). 

Several  conclusions  are  apparent  from  Table  6-2.  First,  the  results  are  very 
dependent  on  both  sample  size  and  sample  period.  Just  as  one  would  expect,  as  the 
sample  size  increases  so  do  some  of  the  R2  values.  The  primary  explanation  is  that 
the  "non-pervasive"  factors  become  less  prevalent  as  the  idiosyncratic  risk  is 
diversified  away.  With  small  samples,  one  security  might  experience  a  stream  of 
large  idiosyncratic  shocks.  It  is  possible  that  one  of  the  extracted  factors  will  be 
perfectly  (or  near  perfectly)  correlated  with  a  particular  securities  time-series  of 
returns.  Some  researchers  have  thrown  out  these  so  called  "Hey wood"  cases. 
Throwing  out  spurious  returns  is  dangerous  -  how  does  one  judge  what  is  or  is  not 
"normal"  in  the  market? 

Table  6-2  clearly  illustrates  the  pitfalls  associated  with  trying  to  identify 
factors  with  very  small  portfolios.  The  R2  values  are  extremely  small  and  there  is  no 
clear  break-point  in  the  values  between  the  factors.  In  the  1986  through  1990  period 
the  results  seem  to  indicate  a  one  factor  model,  as  only  the  first  R2  value  approaches 
one  as  the  sample  size  is  increased.  In  the  overall  ten  year  period,  and  to  some 
extent  in  the  1991  through  1995  period,  the  R2  values  for  the  first  two  factors  seem  to 
dominate  the  others.  These  results  are  consistent  with  the  eigenvalue  plots  in  Figures 
6-1  through  6-3. 

t 

One  possible  explanation  for  the  results  in  Table  6-2  is  that  the  extraction 
technique  loads  up  the  significance  of  the  first  factor.  This  is  a  potential  problem  and 
the  following  section  addresses  it,  and  other  issues,  in  more  detail. 

6.1.3  Validation  of  Techniques 

The  data  presented  in  the  previous  section  was  suggestive  of  only  one  or  two 
"pervasive"  factors.  By  looking  at  the  R2  values  obtained  with  simulated  data  I  will 
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be  able  to  make  statistical  inferences  about  the  equality  of  the  real  world  and 
simulated  factor  structure.  These  inferences  are  only  valid  if  the  asymptotic  principal 
components  technique  can  extract  the  factors  with  little  error  and  if  the  regression 
technique  for  comparing  factors  is  able  to  distinguish  between  pervasive  and  "false" 
factors.  The  results  in  this  section  suggest  that  both  of  the  techniques  (factor 
extraction  and  comparison)  are  robust.47  I  also  look  at  the  sensitivity  of  the 
techniques  to  the  sample  size. 

With  the  simulated  returns  data  I  have  the  luxury  of  knowing  the  "true" 
factors.  I  can  therefore  compare  the  extracted  factors  to  the  true  factors.  For  each  of 
my  five  models  I  generate  the  same  number  of  security  returns  as  there  are  securities 
in  the  various  time  periods  (e.g.,  for  1991  through  1995  I  generate  a  time-series  of 
260  excess  return  observations  for  3,673  securities).  I  then  extract  seven  factors 
from  the  simulated  returns  and  regress  them  on  the  full  set  of  true  factors.  I  run  25 
iterations  for  each  of  the  models.48  Table  6-3  provides  the  results  of  this  analysis. 

Table  6-3  clearly  demonstrates  the  ability  of  both  of  the  techniques.  For  the 
most  part,  in  every  time  period  and  across  the  models,  the  R2  values  of  the  first  five 
factors  approach  one  and  the  two  "false"  factors  have  very  low  (almost  zero)  R2 
values.  For  the  assumed  model  structures  and  sample  sizes  the  error  term  in  equation 
(4.7)  appears  to  be  negligible.  The  results  also  indicate  that  even  if  the  extraction 
technique  loads  up  the  significance  of  the  first  factor  the  regression  method  is  still 
able  to  identify  the  other  pervasive  factors.  This  important  result  should  dispel  any 


47Obviously  the  two  techniques  are  inseparable  -  if  the  factor  regression  technique 
shows  the  factors  are  not  the  same  it  could  be  a  violation  of  either  the  factor 
extraction  technique  or  the  factor  regression  technique. 

48Since  I  generate  returns  for  the  full  sample,  the  simulations  and  factor  extraction 
are  computationally  burdensome.  Relative  to  the  real  world  data,  the  standard 
deviation  of  the  R2  values  are  very  small  so  I  feel  comfortable  with  only  25  iterations 
for  these  results. 
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fears  that  the  rotational  indeterminancy  problem  will  make  the  task  of  factor 
comparison  impossible.  The  following  paragraphs  highlight  some  other  aspects  of 
Table  6-3. 

It  is  apparent  (from  Panel  A  in  Table  6-3)  that  until  the  correlation  coefficient 
becomes  very  large  (e.g.,  0.9)  the  results  are  insensitive  to  the  value  of  rho. 
Subsequent  work  therefore  only  looks  at  the  case  where  rho  is  equal  to  0.5.  The 
differences  in  the  results  from  the  second  and  third  model  (Panels  B  and  C 
respectively  in  Table  6-3)  are  also  virtually  non-existent.  Therefore,  I  only  use  the 
third  model  in  subsequent  work.  The  fourth  model  (Panel  D)  shows  that  scaling  the 
relative  importance  of  the  betas  does  impact  the  techniques  ability  to  extract  them. 
However,  only  the  ability  to  extract  the  fifth  factor  which  was  scaled  by  0.25  seems 
to  suffer  (and  even  then  it's  R2  value  is  significantly  larger  than  the  sixth  and  seventh 
extracted  factors). 

Panel  E  indicates  that  the  technique  is  not  as  efficient  with  data  from  the  fifth 
model.  At  first,  this  seems  unfortunate  since  this  is  the  only  model  that  generates 
returns  following  an  approximate  factor  structure.  There  is  one  very  plausible 
explanation  for  this  result.  The  sample  size  for  the  other  models  ranged  from  2,334 
securities  to  3,367  securities,  whereas  the  fifth  model  is  limited  to  500  securities.  As 
indicated  in  Table  6-2  size  seems  to  be  a  key  issue  and  model  five's  limited  sample 
size  may  bias  the  results. 

The  next  analysis  further  investigates  the  issue  of  size  by  looking  at  factors 
extracted  from  smaller  portfolios  of  simulated  returns.  Specifically,  I  extract  factors 
from  simulated  returns  using  portfolios  of  50,  150,  500,  and  1500  securities  and  then 
I  regress  the  extracted  factors  on  the  true  factors.  Tables  6-4  through  6-7  provide  the 
results.  Again,  there  is  overwhelming  evidence  that  size  is  critical  when  comparing 
factors.  When  the  sample  size  is  50  (Table  6-4)  the  results  seem  to  indicate  only  one 
factor,  even  though  the  returns  are  generated  by  five  factors.  This  offers  much 
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insight  into  the  mixed  results  of  previous  studies  using  portfolios  of  only  30 
securities.49 

Tables  6-4  through  6-9  highlight  one  of  the  key  differences  between  the 
models.  The  first  and  fifth  model  allow  each  security  to  have  a  different  mean  value 
for  it's  simulated  idiosyncratic  return.  Models  three  and  four  assume  the 
idiosyncratic  shocks  are  identically  distributed.  With  unequal  idiosyncratic  variances 
(as  one  would  expect  with  real  data),  sample  size  is  more  important  because  it  is 
more  likely  that  one  random  security  might  have  unusually  large  idiosyncratic  shocks 
that  are  picked  out  as  a  "false"  pervasive  factor.  By  comparing  the  results  in  Panel  A 
of  Table  6-6  (model  one  with  500  securities)  with  the  previously  discussed  results  for 
model  five  in  Panel  D  of  Table  6-2  (which  also  used  only  500  securities)  we  see  that 
the  portfolio  size  probably  played  a  large  role  in  the  somewhat  weak  results  for 
model  five. 

With  a  sample  size  of  1,500  as  in  Table  6-7  the  results  are  encouraging,  but 
somewhat  mixed.  With  model  three's  equally  explanatory  factors  (and  perhaps  more 
importantly)  equal  idiosyncratic  variances  the  technique  is  near  perfect  (as  seen  in 
Panel  B).  The  R2  values  for  the  first  five  factors  are  almost  one  and  the  E2  values  for 
the  "false  factors"  are  no  greater  than  0.001'.  Even  with  the  scaled  factor's  of  model 
four  the  technique  is  fairly  robust.50 

6.1.4  Statistical  Inferences 

Now  that  I  have  demonstrated  the  ability  of  the  techniques  I  can  compare  the 
R2  values  from  the  sample  data  to  the  R2  values  from  the  simulated  data  and  make 


49With  small  portfolios,  factor  analysis  may  outperform  principal  components 
since  factor  analysis  attempts  to  explain  only  common  variation  whereas  principal 
components  explains  total  variation. 

50It  is  important  to  realize  that  the  first  factor  now  has  16  times  the  explanatory 
power  of  the  fifth  factor  and  as  such  the  fifth  factory  may  be  virtually  insignificant. 
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inferences  regarding  their  equality.  Specifically,  if  the  actual  returns  are  generated  by 
a  ^-factor  model  that  is  of  the  same  form  as  one  of  the  five  tested  models,  the  R 
values  should  be  similar.  Tables  6-8  through  6-11  present  the  results  for  sample  sizes 
of  50,  150,  500,  and  1500  securities. 

I  base  the  comparisons  on  one  parametric  test  and  one  nonparametric  test. 

The  parametric  test  is  Satterthwaite's  approximate  test  of  equality  in  means  for  two 
independent,  normally  distributed  samples  with  unequal  variances.  The 
nonparametric  test  is  the  Wilcoxon  rank  sum  test  for  equality  in  means.  The 
Wilcoxon  test  assumes  the  samples  are  independent  and  from  the  same  distribution 
(but  the  distribution  need  not  be  normal).  I  use  the  nonparametric  test  because  a 
Shapiro-Wilk  test  indicates  that  the  R2  values  are  not  from  a  normal  distribution. 

Table  6-8  shows  that  for  a  sample  size  of  50  the  hypothesis  of  equal  means  is 
rejected  for  almost  every  factor  and  every  model  with  both  of  the  test  statistics.  The 
one  possible  exception  is  the  results  for  the  first  model  in  Panel  B.  This  is 
encouraging  considering  that  the  "true"  factors  in  the  first  model  are  extracted  from 
the  real  world  data  that  they  are  now  being  compared  to.  In  fact,  the  first  model  is 
almost  like  a  control  sample.  Given  the  previous  results  on  the  importance  of  sample 
size,  little  additional  information  can  be  gleaned  from  these  small  sample  results  in 
Table  6-8. 

As  the  sample  size  increases,  it  is  obvious  that  the  real  returns  were  not 
generated  by  a  model  with  five  equally  important  factors  and  equal  idiosyncratic 
variances.  The  r-statistics  for  comparing  model  three  and  four  to  the  real  world 
model  for  some  factors  are  in  excess  of  100  -  resulting  in  overwhelming  rejections  of 
the  null  hypothesis  that  the  mean  R2  values  are  the  same. 

The  results  from  model  one  offer  the  most  insight  as  to  the  true  nature  of  the 
factor  structure  of  the  real  world  data.  The  factors  extracted  from  the  real  world  data 
are  used  as  the  "true"  factors  in  the  first  model.  Looking  at  Panel  B  in  Table  6-11, 
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an  interesting  pattern  emerges  for  the  R2  values  of  the  simulated  data  versus  those 
from  the  real  world  data  in  Panel  A.  First,  in  this  case  (and  in  several  others),  I 
notice  that  the  results  are  very  sensitive  to  the  time  period.  The  1986-1990  time 
period  consistently  has  higher  f-statistics.  This  might  be  due  in  part  to  the  October, 
1987  market  crash.  Looking  at  the  other  two  periods,  there  is  some  evidence  that  the 
first  factor's  R2  values  are  the  same  but  I  reject  the  null  that  factor's  two  through  five 
are  the  same.51  These  results  strongly  suggest  that  the  real  world  data  is  not 
generated  by  a  five  factor  model.  In  fact  the  data  suggests  there  are  at  most  two 
pervasive  factors.  This  is  perhaps  one  of  the  most  significant  results. 

Overall  the  results  are  encouraging.  I  have  not  identified,  nor  do  I  attempt  to 
identify,  the  actual  factor  structure  generating  the  real  security  returns.  I  have, 
however,  provided  a  screening  test  for  evaluating  any  proposed  factor  structure. 

After  comparing  the  real  world  data  to  several  alternative  factor  structures  I  conclude 
that  none  of  the  proposed  structures  produce  returns  consistent  with  the  observable 
results.  I  also  conclude  that  the  number  of  pervasive  factors  in  the  economy,  at  least 
for  the  time  periods  in  my  study,  appears  to  be  less  than  five  and  more  likely  one  or 
two. 


6.2  Hypothesis  #2  -  Risk  Premiums 
This  section  presents  the  results  of  my  second  hypothesis  which  states  the 
price  of  factors  should  be  the  same  across  samples.  As  previously  mentioned, 
macroeconomic  proxies  offer  an  intuitive  appeal  to  the  APT  by  giving  a  meaning  to 
the  unknown  factors.  Unfortunately,  as  past  research  has  documented,  the  number 
and  nature  of  the  priced  factors  is  sensitive  to  numerous  issues  (e.g.,  how  the  data  is 
grouped  into  portfolios,  how  the  macroeconomic  variables  are  aligned  with  the 

51For  the  1986-1995  period  there  is  weak  evidence  that  factors  two  and  three  have 
the  same  R2  values. 
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returns  data,  how  the  innovations  are  specified,  what  time  period  is  studied,  etc.).  In 
short,  the  cost  of  the  macroeconomic  proxies  intuitive  appeal  is  the  spurious  nature  of 
pricing  tests  that  use  them. 

My  results  suggest  a  new  technique  for  evaluating  the  various  proxies  than  can 
complement  normal  tests  for  significance.  I  start  with  an  overall  sample  of  1,680 
securities  and  employ  the  same  methodology  as  used  by  Chen,  Roll,  and  Ross  (1986) 
and  discussed  in  Chapter  4.  I  estimate  a  time-series  of  120  observations  (ten  years  of 
monthly  data)  for  each  of  the  risk  premiums  based  upon  numerous  models  -  the 
models  differ  only  in  the  number  of  macroeconomic  proxies  used.  I  then  divide  the 
sample  in  half  (so  that  there  are  840  securities  in  each  sample)  and  repeat  the 
procedure.  Results  for  several  of  the  models  are  provided. 

6.2.1  Sorting  Schemes 

A  portfolio  grouping  of  returns  is  used  to  partially  eliminate  the  errors-in- 
variables  problem  associated  with  the  use  of  estimated  betas  in  the  cross-sectional 
regressions.  The  portfolios  will  eliminate  much  of  the  idiosyncratic  risks  and  should 
therefore  provide  cleaner  estimates  of  the  prices.  As  mentioned  earlier,  the  grouping 
scheme  is  of  critical  importance  and  several  researchers  have  shown  that  the  number 
and  nature  of  the  priced  factors  is  dependent  on  the  grouping  scheme.  The  groupings 
need  to  eliminate  the  idiosyncratic  risk  and  provide  a  good  dispersion  of  estimated 
betas  for  all  of  the  factors.  Unfortunately,  different  schemes  will  tend  to  offer 
varying  degrees  of  dispersion  for  the  different  factors  and  can  thus  bias  the  results.  If 
there  is  little  cross-sectional  variation  the  OLS  regressions  will  have  a  hard  time 
accurately  predicting  the  price. 

I  choose  two  grouping  schemes.  The  first  variable  I  group  on  is  size. 
Specifically,  I  use  the  prior  year's  end-of-year  market  capitalization  as  provided  by 
CRSP.  I  rank  the  securities  according  to  size  and  then  form  112  portfolios  of  15 
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securities  for  the  overall  sample  and  56  portfolios  of  15  securities  for  the  sub¬ 
samples. 

I  then  repeat  the  entire  analysis  grouping  by  industry.  I  looked  at  several 
schemes  for  grouping  by  industry.  One  obvious  choice  is  to  use  2  digit  SIC  values. 
Unfortunately  this  did  not  provide  a  large  enough  number  of  portfolios  and  the 
number  of  securities  in  each  portfolio  was  bi-modal  -  roughly  half  of  the  industries 
contained  over  100  securities  and  the  other  half  contained  very  few  securities. 

Ideally,  I  would  like  a  large  number  of  industries  (so  that  I  have  enough  data  points 
for  the  cross-sectional  regressions)  and  a  large  number  of  securities  in  each  portfolio 
(to  effectively  diversify  away  the  idiosyncratic  risk).  U.S.  Industrial  Outlook 
provides  a  breakdown  of  over  100  industries  based  on  4  digit  SIC,  but  I  felt  with  that 
many  industries,  the  number  of  securities  in  each  portfolio  would  be  too  small.  I 
compromised  between  the  two  techniques  and  used  a  breakdown  of  48  industries 
provided  by  Fama  and  French  (1997).  Appendix  C  provides  a  list  of  the  industries, 
the  SIC  codes  they  contain,  and  the  number  of  securities  from  my  sample  in  each  of 
the  industries.  The  number  in  each  portfolio  can  vary  and  ranges  from  4  to  153. 

Over  one-half  of  the  industries  have  at  least  25  securities  and  the  average  number  in 
each  portfolio  is  35.  For  the  sub-samples,  I  randomly  divide  the  securities  in  each  of 
the  industries. 

Table  6-12  provides  some  measures  of  how  effective  the  two  sorting  schemes 
are  at  dispersing  the  betas  for  each  of  the  proxies.  The  various  panels  in  Table  6-12 
differ  in  the  number  of  proxies  included  in  the  model.  I  would  like  the  standard 
deviation  for  the  betas  to  be  large.  As  suggested,  the  sorting  schemes  tend  to 
disperse  some  betas  more  so  than  others.  For  example,  in  Panel  A  (which  uses  all 
six  of  the  proxies  as  defined  in  Chapter  5)  I  notice  that  the  standard  deviation  of  the 
betas  for  MP,  EW,  and  UTS  are  small  relative  to  those  for  UI,  DEI,  and  URP.  The 
level  of  dispersion  is  also  affected  by  which  variables  are  in  the  model  as  evidenced 
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by  the  remaining  panels.  There  does  not  appear  to  be  any  large  systematic 
differences  in  dispersion  between  the  two  sorting  schemes. 

6.2.2  Statistical  Inferences 

The  second  hypothesis  involves  comparing  the  prices  of  risk  estimated  from 
two  different  samples.  Before  doing  so,  it  is  informative  to  look  at  the  estimated 
price  of  risk  using  the  entire  sample.  As  documented  by  previous  researchers,  I  find 
that  the  estimated  prices  and  their  significance  are  very  unstable  across  time-periods 
and  models.  Across  time  periods  the  variation  is  attributed  to  the  different  prevailing 

f 

economic  conditions.  The  variation  across  models  is  due  to  the  correlation  between 
the  individual  proxies.  As  different  proxies  are  added  or  subtracted  from  the  model, 
the  prices  can  drastically  change.  This  is  an  unfortunate,  but  unavoidable,  problem.52 

Table  6-13  shows  the  estimated  price  of  the  various  risk  factors  for  the  ten  and 
five  year  periods.  Panel  A  is  for  the  model  with  all  of  the  variables,  Panel  B 
eliminates  EW,  Panel  C  eliminates  EW  and  UTS,  and  Panel  D  eliminates  EW  and 
URP.  I  eliminate  the  market  index  since  it  should  have  no  special  role  in  the  APT 
and  I  eliminate  the  risk  premium  and  term  structure  variables  (one  at  a  time)  because 
of  their  high  correlation  with  each  other. 

There  are  several  results  worth  highlighting.  First,  across  all  of  the  models 
and  time  periods,  most  of  the  factors  are  insignificantly  priced  and  the  intercept 
(which  should  be  zero)  is  significantly  priced.  The  few  exceptions  include  UI,  URP, 
and  UTS  which  are  priced  in  at  least  one  case.  Chen,  Roll,  and  Ross  (1986)  found 
no  significantly  priced  factors  in  their  1978-1984  time  period  (which  is  the  closest 
time-period  to  my  study).  I  also  ran  a  model  with  just  the  equally  weighted  index 


52Some  argue  this  problem  can  be  avoided  by  orthogonalizing  the  factors  using 
progressive  OLS  regressions  (e.g.,  regress  factor  two  on  factor  one  and  use  the 
residuals  as  the  new  factor  two,  then  regress  factor  three  on  factors  one  and  two, 
etc.).  The  problem  is  still  there  since  the  prices  will  depend  on  the  order  of 
orthogonalization. 
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and  find  that  the  price  is  negative  but  insignificant.  This  result  supports  Fama  and 
French  (1992)  who  find  similar  results  using  data  from  recent  time  periods.  When  I 
break  the  samples  in  half  this  pattern  continues  -  different  factors  are  priced  in 
differing  samples. 

The  sorting  schemes  also  have  an  effect  on  the  various  prices.  This  is  an 
interesting  result  given  the  somewhat  consistent  level  of  dispersion  in  betas  across  the 
two  sorting  schemes.  I  make  no  conjectures  about  the  theories  regarding  the  signs  of 
the  various  risk  premiums.  They  have  no  bearing  on  my  research  and  it  is  somewhat 
frivolous  to  discuss  the  signs  since  most  were  insignificantly  different  than  zero. 

Table  6-13  highlights  the  already  mentioned  fundamental  weakness  of  the 
macroeconomic  proxies  -  they  are  simply  not  stable  and  as  such  any  inferences  should 
be  made  with  caution.  Of  course  the  results  in  my  study  are  only  valid  for  my 
definitions  of  the  various  proxies. 

At  first,  the  results  of  the  previous  paragraphs  are  disheartening,  why  do  I 
care  if  the  risk  premium  is  the  same  across  samples  if  it  is  insignificant  in  the  first 
place?  The  answer  is  simple;  this  technique  may  help  screen  out  those  factors  that 
are  significantly  priced  in  some  time-periods  and  not  in  others.  Table  6-14  provides 
the  results  for  this  analysis.  I  conduct  both  a  parametric  and  a  nonparametric  test. 
Unlike  the  independent  samples  of  R2  values  used  in  testing  hypothesis  one,  the  two 
time-series  of  price  estimates  from  the  sub-samples  are  naturally  paired  (since  they 
are  estimated  for  the  same  month).  Therefore,  I  use  a  paired  r-test  and  a  Wilcoxon 
signed  rank  test  for  the  second  hypothesis. 

The  results  are  given  in  Table  6-14.  As  one  might  predict,  it  is  hard  to  reject 
the  equality  of  the  risk  premiums  since  the  standard  error  of  the  estimates  is  large. 
Even  so,  there  are  cases  where  I  reject  the  equality  of  the  two  price  estimates.  In 
some  instances  I  reject  the  equality  across  groups  where  the  risk  premium  was 
significant  in  one  of  the  subsamples.  Looking  at  Panel  A2,  I  notice  that  the  UI  factor 
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was  significantly  priced  for  one  of  the  subsamples  when  I  sorted  by  size.  Yet,  when 
I  compare  it's  prices  across  subsamples  I  reject  the  null  of  equality.  When  sorting  by 
industry  there  is  weak  evidence  of  the  same  phenomena. 

Looking  across  the  various  models  and  time  periods,  a  few  patterns  emerge.53 
For  example,  the  equality  of  URP's  risk  premium  across  samples  is  rejected  in 
roughly  one-half  of  the  cases  and  I  rarely  reject  the  equality  of  DEI's  risk  premium. 
The  evidence  presented  suggests  that  even  when  a  factor  is  significant,  it's  price  may 
not  be  the  same  across  subsamples  and  therefore  should  not  be  considered  a  pervasive 
factor. 


530nly  a  few  of  the  cases  are  presented  in  Table  6-14.  I  ran  the  tests  for  all 
models  and  time  periods.  The  results  are  similar  with  the  exception  of  the  1986-1990 
time  period  (which  are  not  reported). 


CHAPTER  7 


SUMMARY  AND  CONCLUSIONS 

This  dissertation  develops  and  empirically  tests  two  techniques  for  examining 
the  robustness  of  factors  in  an  APT  context.  The  underlying  logic  is  simple  -  if  the 
APT  holds  all  securities  will  be  affected  by  the  same  factors  and  the  prices  of  the 
factors  will  be  the  same  for  any  security.  Therefore  if  a  large  portfolio  of  stocks  is 
divided  into  two  groups  the  factors  and  prices  estimated  from  the  groups  should  be 
the  same.  Brown  and  Weinstein  (1983)  apply  a  similar  logic  in  their  test  of  the  APT. 
My  methodology,  however,  offers  many  advantages  over  previous  tests. 

Summarizing  the  results,  I  find  that  the  asymptotic  principal  components 
technique  is  able  to  extract  the  true  factors  and  the  factor  regression  technique  is  able 
to  compare  the  factors  with  little  error.  The  results  are  sensitive  to  sample  size, 
particularly  in  those  models  that  allow  each  security  to  have  a  unique  variance.  The 
empirical  evidence  suggests  that  the  real  world  data  is  not  generated  by  any  of  the 
models  I  compare  it  to  and  there  appears  to  be  only  one  or  two  pervasive  factors. 

The  prices  of  risk  for  the  macroeconomic  variables  are  generally  insignificant.  In 
some  cases  a  proxy  is  priced  in  one  sample  but  the  price  is  not  the  same  across 
subsamples. 

In  order  to  compare  the  factors  I  first  have  to  extract  them  from  the  time- 
series  of  returns  data.  Most  of  the  previous  work  in  this  area  has  relied  on  factor 
analysis  and  a  small  number  of  securities  in  each  portfolio.  I  rely  on  Connor  and 
Korajczyk's  (1986,  1988)  asymptotic  principal  component  technique  which  allows  me 
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to  use  a  large  number  of  securities  in  my  analysis.  Given  the  asymptotic  nature  of 
the  APT  it  is  imperative  to  use  a  large  sample. 

I  develop  a  new  methodology,  again  based  on  the  techniques  of  Connor  and 
Korajczyk,  for  directly  comparing  the  factors  extracted  from  different  portfolios.  To 
my  knowledge  no  one  has  ever  used  this  methodology  to  compare  factors  extracted 
from  different  portfolios  of  securities.  In  fact,  most  of  the  previous  studies  have  not 
actually  compared  the  factors  but  rather  some  by-product  or  implication  of  the 
factors.  For  example,  Brown  and  Weinstein  (1983)  compare  the  estimates  of  error 
variances  obtained  when  the  factors  are  extracted  from  subsets  to  those  when  they  are 
extracted  from  the  entire  group.  Although  a  common  set  of  factors  would  pass  a  test 
like  this,  so  would  many  other  sets  of  factors  that  were  totally  different  -  as  long  as 
they  explained  a  similar  portion  of  the  cross-sectional  variance.  My  methodology 
compares  the  factors  directly  and  this  is  a  critical  advantage. 

Another  advantage  of  the  factor  comparison  technique  is  that  it  is  a  test  of  the 
primary  assumption  of  the  APT.  As  such,  it  is  not  subject  to  some  of  the  criticisms 
associated  with  tests  of  the  APT's  pricing  relationship.  Since  the  APT  is  only  an 
approximation,  any  empirical  test  involves  additional  assumptions  in  order  to  yield  a 
strict  equilibrium  model.  A  common  assumption  is  that  there  is  some  portfolio  that  is 
mean- variance  efficient  relative  to  the  set  of  factors  chosen.  These  assumptions 
complicate  the  tests  because  the  tests  then  become  a  joint  test  of  the  APT  and  the 
additional  assumptions. 

For  comparing  the  prices  I  use  macroeconomic  proxies  for  the  factors. 
Numerous  researchers  have  used  proxies  such  as  the  ones  I  use  for  a  variety  of  tests 
involving  the  APT.  Several  papers  have  compared  risk  premiums  estimated  across 
different  time  periods.  No  one  has  compared  the  prices  from  two  samples  over  the 
same  time  period.  This  is  a  much  cleaner  test  of  the  APT  as  one  may  reasonably 
expect  the  factors  and  their  prices  to  vary  over  time. 
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For  the  factor  comparison  section  I  develop  five  models  that  generate  security 
returns  under  varying  assumptions.  Two  of  the  models  are  essentially  those  used  by 
Connor  and  Korajczyk  (1988,  1993).  The  other  three  are  extensions  of  their  work 
and  offer  their  own  contributions.  Previous  models  have  assumed  either  a  strict 
factor  structure  or  an  economy  in  which  the  idiosyncratic  return  of  every  security  is 
correlated  (across  securities  but  not  intertemporally).  I  develop  a  model  which 
generates  the  idiosyncratic  return  based  on  the  sample's  covariance  matrix.  If  the 
returns  are  generated  by  an  approximate  factor  structure,  as  the  asymptotic  principal 
components  technique  assumes,  this  is  an  important  feature  of  my  work. 

Next,  I  demonstrate  that  the  asymptotic  principal  components  method  and  the 
factor  regression  method  I  use  to  compare  factors  are  robust  in  their  ability  relative  to 
the  five  models.  I  do  this  by  comparing  the  extracted  factors  to  the  true  factors  for  a 
variety  of  sample  sizes.  These  results  are  interesting  in  their  own  right  as  they 
clearly  demonstrate  that  comparing  factors  based  on  small  portfolios  is  virtually 
meaningless.  Another  important  result  from  these  tests  is  that  the  rotational 
indeterminacy  problem  associated  with  the  asymptotic  principal  components 
techniques  does  not  affect  my  methodologies  ability  to  identify  them  as  pervasive 
factors. 

Then,  using  the  factor  regression  technique,  I  compare  the  R2  values  obtained 
from  subsets  of  simulated  data  to  those  obtained  from  subsets  of  the  real  world  data. 
The  results  indicate  that  the  returns  from  the  real  world  data  are  clearly  not  generated 
by  a  five  factor  model  where  the  five  factors  are  equally  important  and  have  equal 
idiosyncratic  variances.  This  is  the  model  used  by  Connor  and  Korajczyk  (1993)  to 
test  their  methodology  for  identifying  the  number  of  statistically  priced  factors.  As 
such,  one  has  to  question  if  their  technique  is  valid  on  real  world  data. 

More  importantly,  I  conclude  that  the  returns  data  is  characterized  by  at  most 
two,  and  maybe  only  one,  pervasive  factor.  Using  a  model  that  simulates  returns 
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based  on  the  five  most  dominant  factors  in  the  real  world  data,  I  find  that  the  E? 
values  obtained  for  the  first  and  second  factor  are  insignificantly  different  than  those 
from  the  real  data  for  the  first  and  second  factor.  The  third  and  fourth  factor  from 
the  simulated  returns  have  significantly  higher  R2  values  than  the  real  world  data.54 
Previous  researchers  have  suggested  that  similar  findings  are  due  to  the  rotational 
indeterminacy  of  the  techniques.  My  simulation  results  discussed  above  should 
partially  alleviate  these  fears. 

The  results  for  the  equality  of  prices  across  subsamples  are  not  as  strong  as 
for  the  factor  comparisons,  but  still  offer  some  contributions.  As  documented  by 
numerous  other  researchers,  I  find  that  the  prices  are  very  unstable  and  for  the  most 
part  insignificant.  The  prices  differ  across  time  periods  and  across  models.  This 
variation  is  somewhat  understandable.  Changing  economic  conditions  may  change  the 
factor  structure  and  the  correlation  of  the  proxies  will  impact  all  the  prices  as  various 
proxies  are  added  to  or  removed  from  the  model. 

The  difference  across  subsamples  for  the  same  time  period  and  using  the  exact 
same  model,  which  is  a  unique  aspect  of  my  research,  is  somewhat  disturbing.  There 
are  two  possible  explanations  and  I  think  both  of  them  contribute  to  my  findings. 

First,  returns  data  for  securities,  even  after  grouping  them  into  portfolios,  are  very 
noisy.  Second,  there  are  thousands  of  different  potential  macroeconomic  proxies  and 
even  after  choosing  the  "correct"  proxy,  one  has  to  specify  the  innovation  in  that 
variable  and  how  to  align  the  time-series  with  the  returns  data.  If  there  are  only  one 
or  two  factors,  as  the  previous  sections  suggest,  the  choice  of  identifying  the  right 
one  is  seemingly  impossible! 

Nonetheless,  the  methodology  I  presented  offers  another  technique  to  screen 
potential  proxies  and  it  allows  one  to  make  inferences  using  the  entire  sample  period. 

54The  R2  value  for  the  fifth  factor  is  also  higher.  Due  to  large  standard  errors  for 
it  and  the  R2  value  from  the  actual  data's  fifth  factor  the  difference  is  insignificant. 
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In  other  words,  if  a  factor  is  found  to  be  significant  over  some  time  period  this  test  is 
another  way  to  identify  if  it  is  indeed  a  pervasive  factor. 

As  the  previous  paragraphs  indicate,  the  results  of  this  dissertation  already 
offer  several  significant  contributions  to  the  field  of  finance.  However,  there  are 
several  theoretical  and  empirical  areas  that  warrant  further  investigation.  These 
suggested  recommendations  are  discussed  in  the  following  paragraphs. 

It  appears  that,  especially  for  the  real  world  data,  the  R2  values  for  the  first 
and  second  factor  are  very  polar  -  some  are  around  0.8  to  1.0  and  others  are  around 
0.0  to  0.2  and  not  very  many  fall  in  between.  One  explanation  is  that  a  security  in 
the  sample  experienced  a  stream  of  large  and  unusual  idiosyncratic  shocks  and  that 
security  returns  dominate  the  true  factor  or  factors.  In  the  extreme  a  "Heywood" 
case  occurs  when  one  of  the  factors  is  near  perfectly  correlated  with  the  returns  of 
one  security.  Some  researchers  have  thrown  out  these  cases  (an  arguably  dangerous 
approach).  Another  idea  would  be  to  increase  the  sample  size  to  further  "wash  out" 
the  idiosyncratic  risk  and  reduce  the  chance  of  that  security  being  sampled  over  the 
monte-carlo  runs. 

There  might  be  some  way  to  scale  each  asset's  returns  (both  systematic  and 
unsystematic)  without  affecting  the  factor  structure.  One  idea,  analogous  to  weighted 
least  squares,  is  to  divide  each  security  return  by  the  estimated  standard  deviation  of 
it's  error  terms.  By  decreasing  the  idiosyncratic  shocks,  the  true  factor  structure  may 
be  easier  to  identify. 

The  number  of  possible  factor  structures  to  explore  is  limitless  and  different 
models  might  offer  more  insight.  Two  suggested  alternatives  are:  1)  to  look  at  a  one 
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or  two  factor  model,  and  2)  to  increase  the  number  of  securities  in  my  fifth  model 
which  uses  the  full  sample  covariance  matrix.55 

For  the  pricing  comparisons  most  of  the  enhancements  would  involve  some 
advanced  econometric  technique.  For  example,  weighted  least  squares,  seemingly 
unrelated  regressions,  or  Shanken's  (1992)  EIV  correction  could  be  applied.  Even 
with  these  techniques  many  researchers  have  found  few  priced  factors  and  price 
instability  across  groups.  The  use  of  macroeconomic  proxies  will  always  be  popular 
because  of  their  intuitive  appeal.  The  proxies  are  valuable  in  helping  explain  what 
affected  certain  securities  at  certain  times.  I  think  the  future  payoff  is  in  the  work  of 
those  that  are  comparing  the  proxies  to  extracted  factors. 


55This  is  a  computationally  time  consuming  task.  It  involves  estimating  a  full 
n  x  n  covariance  matrix  and  then  each  random  draw  involves  hundreds  of  thousands 
of  calculations. 
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Time-Series  Parameters  for  Macroeconomic  Proxies 
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Note:  Panel  E  does  not  have  results  for  the  two  five  year  intervals  because  Model  #5  requires  that  the  number  of  assets  be  less  than 
the  number  of  observations  in  the  time-series  of  returns.  Therefore,  I  only  look  at  the  ten  year  period  so  that  sample  size  can  be 
reasonably  large. 


Comparing  Extracted  Factors  to  True  Factors  Using  Small  Portfolio  (n  =  50) 
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Note:  Panel  D  does  not  have  results  for  the  two  five  year  intervals  because  Model  # 5  requires  that  the  number  of  assets  be  less 
than  the  number  of  observations  in  the  time-series  of  returns.  Therefore,  I  only  look  at  the  ten  year  period  so  that  the  sample  size 
can  be  reasonably  large. 
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Comparing  the  R2  Values  From  Real  Data  and  Simulated  Data  -  ( n  =  150) 
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Comparing  the  R2  Values  From  Real  Data  and  Simulated  Data  -  (n  =  500) 
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r-stat  is  for  Satterthwaite’s  approximate  test  of  equality  with  unequal  variances. 
Vstat  is  for  Wilcoxon  rank  sum  test. 
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Comparing  the  R2  Values  From  Real  Data  and  Simulated  Data  -  (n  =  1500)* 
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TABLE  6-12 

Dispersion  of  the  Beta  Coefficients  in  OLS  Regressions 
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TABLE  6-13 

Pricing  of  the  Macroeconomic  Factors 
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TABLE  6-14 

Comparing  the  Price  of  Factors  Across  Samples 
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r-stat  is  from  a  paired  r-test. 

z-stat  is  from  a  Wilcoxon  signed  rank  test. 

The  risk  premium  was  significant  (at  the  10%  level)  in  the  first  sample. 

The  risk  premium  was  significant  (at  the  10%  level)  in  the  second  sample. 

!  The  risk  premium  was  significant  (at  the  10%  level)  in  the  first  and  second  sample. 
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25  Eigenvalues  from  1986-1995  Sample 


3n|eAU36;a 


FIGURE  6-2 

First  25  Eigenvalues  from  1986-1990  Sample 
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This  table  provides  the  industry  groupings  as  defined  in  Fama  and  French  (1997).  The 
first  column  reports  the  number  of  firms  in  each  industry  from  my  entire  sample  of 
1680  securities. 


# 

Name 

Industry 

SIC  Codes 

8 

Agric 

Agriculture 

0100-0799,  2048 

36 

Food 

Food  Products 

2000-2046,  2050-2063,  2070-2079, 
2090-2095,  2098-2099 

6 

Soda 

Candy  and  Soda 

2064-2068,  2086-2087,  2096-2097 

7 

Beer 

Alcoholic  Beverages 

2080-2085 

4 

Smoke 

Tobacco  Products 

2100-2199 

18 

Toys 

Recreational  Products 

0900-0999,  3650-3652,  3732, 
3930-3949 

6 

Fun 

Entertainment 

7800-7841,  7900-7999 

23 

Books 

Printing  and  Publishing 

2700-2749,  2770-2799 

49 

Hshld 

Consumer  Goods 

2047,  2391-2392,  2510-2519, 

2590-2599,  2840-2844,  3160-3199, 
3229-3231,  3260,  3262-3263,  3269, 
3630,  3630-3639,  3750-3751,  3800, 
3860-3879,  3910-3919,  3960-3961, 
3991,  3995 


28 

Clths 

Apparel 

2300-2390,  3020-3021,  3100-3111, 
3130-3159,  3965 

9 

Hlth 

Healthcare 

8000-8099 

28 

MedEq 

Medical  Equipment 

3693,  3840-3851 

26 

Drugs 

Pharmaceutical  Products 

2830-2836 

53 

Chems 

Chemicals 

2800-2829,  2850-2899 

19 

Rubbr 

Rubber  and  Plastic  Products 

3000,  3050-3099 

15 

Txtls 

Textiles 

2200-2295,  2297-2299,  2393-2395, 
2397-2399 

77 

BldMt 

Construction  Materials 

0800-0899,  2400-2439,  2450-2459, 

2490-2499,  2950-2952,  3200-3219, 
3240-3259,  3261,  3264,  3270-3299, 
3420-3442,  3446-3452,  3490-3499, 
3996 


16 

Cnstr 

Construction 

1500-1549,  1600-1699,  1700-1799 

31 

Steel 

Steel  Works,  etc. 

3300-3369,  3390-3399 

12 

FabPr 

Fabricated  Products 

3400,  3443-3444,  3460-3479 

84 

Mach 

Machinery 

3510-3536,  3540-3569,  3580-3599 

32 

ElcEq 

Electrical  Equipment 

3600-3621,  3623-3629,  3640-3646, 

3648-3649,  3660,  3691-3692,  3699 


# 

Name 

Industry 

SIC  Codes 

11 

Misc 

Miscellaneous 

3900,  3990,  3999,  9900-9999 

33 

Autos 

Automobiles  and  Trucks 

2296,  2396,  3010-3011,  3537,  3647, 
3694,  3700-3716,  3790-3792,  3799 

12 

Aero 

Aircraft 

3720-3729 

4 

Ships 

Shipbuilding,  Railroad  Equip 

3730-3731,  3740-3743 

4 

Guns 

Defense 

3480-3489,  3760-3769,  3795 

6 

Gold 

Precious  Metals 

1040-1049 

16 

Mines 

Nonmetallic  Mining 

1000-1039,  1060-1099,  1400-1499 

6 

Coal 

Coal 

1200-1299 

94 

Energy 

Petroleum  and  Natural  Gas 

1310-1389,  2900-2911,  2990-2999 

153 

Util 

Utilities 

4900-4999 

16 

Telcm 

T  elecommunications 

4800-4899 

9 

PerSv 

Personal  Services 

7020-7021,  7030-7039,  7200-7212, 

7215-7299,  7395,  7500,  7520-7549, 
7600-7699,  8100-8199,  8200-8299, 
8300-8399,  8400-8499,  8600-8699, 
8800-8899 


87 

BusSv 

Business  Services 

2750-2759,  3993,  7300-7372, 
7374-7394,  7397,  7399,  7510-7519, 
8700-8748,  8900-8999 

35 

Comps 

Computers 

3570-3579,  3680-3689,  3695,  7373 

93 

Chips 

Electronic  Equipment 

3622,  3661-3679,  3810,  3812 

33 

LabEq 

Measuring  and  Control  Equip 

3811,  3820-3830 

31 

Paper 

Business  Supplies 

2520-2549,  2600-2639,  2670-2699, 
2760-2761,  3950-3955 

14 

Boxes 

Shipping  Containers 

2440-2449,  2640-2659,  3210-3221, 
3410-3412 

34 

Trans 

Transportation 

4000-4099,  4100-4199,  4200-4299, 
4400-4499,  4500-4599,  4600-4699, 
4700-4799 

54 

Whlsl 

Wholesale 

5000-5099,  5100-5199 

64 

Rtail 

Retail 

5200-5299,  5300-5399,  5400-5499, 
5500-5599,  5600-5699,  5700-5736, 
5900-5999 

29 

Meals 

Restaurants,  Hotel,  Motel 

5800-5813,  5890,  7000-7019, 
7040-7049,  7213 

43 

Banks 

Banking 

6000-6099,  6100-6199 

47 

Insur 

Insurance 

6300-6399,  6400-6411 

14 

RIEst 

Real  Estate 

6500-6553 

151 

Fin 

Trading 

6200-6299,  6700-6799 

